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(57) Abstract: Nature evolves biological 
molecules such as proteins thzx}Ugh iterated 
rounds of diversification, selection, and 
amplification. The present invention pro- 
vides methods, compositions, and systems 
for syntheiszing, selecting, amplifying, and 
evolving non-natural molecules based on 
nucleic acid templates, the sequence of a 
nucleic acid template is used to direct the 
synthesis of non-natural molecules such 
as unnatural polymers and small molecules. 
Using this method combinatorial libraries 
of these molecules can be prepared and 
screened. Upon selection of a molecule, 
its encoding nucleic acid template may 
be amplified and/or evolved to yield the 
same molecule of the present invention 
allow for the amplification and evolution of 



non-natural tnolecules in a manner analogous to the amplification of natural biopblymer such as poIynucleoUdes and protein. 
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EvoLvmc New Mou:cxJLAR Function 

Priority Information 
[00011 This application claims priority under 35 U.S.C, § 119(e) to U.S. Provisional patent 
applications 60/277,081, filed March 19, 2001, entitled '"Nucleic Acid Directed Synthesis of 
Chemical Compounds":. 60/277,094, filed March 19, 2001, entitled "Approaches to Generating 
New Molecular Function"; and 60/306,691, filed July 20, 2001, entitled "Approaches to 
Generating New Molecular Function", and the entire contents of each of these applications are 
hereby incorporated by reference. 

BACKGROON0 OF THE Invention 
[00021 The classic "chemical approach" to generating molecules with new fimctions has been 
used extensively over the last century in applications ranging fi:om drug discovery to synthetic 
methodology to materials science. In this approach (Fig. U black), researchers synthesize or 
isolate candidate molecules, assay these candidates for desired properties, determine the 
structures of active compounds if unknown, formulate structure-activity relationships based on 
the assay and structural data, and then synthesize a new generation of molecules designed to 
possess improved properties. While combinatorial , chemistry methods (see, for example. A- V. 
Eliseev and J. M. Lehn. Combinatorial Chemistry In Biology 1999, 243, 159-172; K. W. Kuntz, . 
M. L. Snapper and A. H. Hoveyda. Current Opinion in Chemical Biology 1999, 5, 313-319; D. 
R. Liu and P. G. Schultz. Angew. Chem, Intl. Ed Eng 1999, 38, 36) have increased the 
throughput of this approach, its fundamental limitations remmn unchanged. Several factors limit 
tlie effectiveness of the chemical approach to generating molecular fimction. First, our abUity to 
accurately predict the structural changes that will lead to new fimction is often inadequate due to 
subtle conformational rearrangemeats of molecules, unforeseen solvent interactions, or unknown 
stereochemical requirements of binding or reaction events. The resulting complexity of 
structure-activity relationships frequently limits the success of rational ligand or catalyst design, 
including those efforts conducted in a high-throughput maimer. Second, the need to assay or 
screen, rather than select, each member of a collection of candidates limits the number of 
molecules that can be searched in each experiment. Finally, the lack of a way to amplify 
synthetic, molecules places requirements on the minimum amomit of material that must be 
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produced for characterization, screening, and structure elucidation. As a result, it can be difficult 
to generate libraries of more than roughly 10^ different synthetic compounds. 
[00031- In contrast. Nature generates proteins with new functions using a fundamentally 
different method that overcomes many of these limitations. In this approach (Fig. 1, gray), a 
protein with desired properties induces the survival and amplification of the information 
encoding tliat protein. This information is diversified through spontaneous mutation and DNA 
recombination, and . then translated into a new generation . of candidate proteins using the ^ 
ribosome. The power of this process is well appreciated {see, F. Arnold Acc. Chem. Res, 1998, 
31, 125; F.H. Arnold etaL Curr. Opin. Chem. Biol 1999, 3, 54-59; J. Minshull etal Cum Opift 
Chem, BioL 1999, i, 284-90) and is evidenced by the fact that proteins and nucleic acids 
dominate the solutions to many complex chemical problems despite their limited chemical 
ftinctionality. Clearly, unlike the linear chemical approach described above, the steps used by 
Nature form a cycle of molecular evolutioiL Proteins emerging from this process have been 
directly selected, rather than simply screened, for desired activities. Because the information 
encoding evolving proteins ODNA) can be amplified, a single protein molecule with desired 
activity can in theory lead to the survival and propagation of the DNA encoding its structure. 
The vanishirigly small amounts of material needed to participate in a cycle of molecular 
evolution allow libraries much larger in diversity than those synthesized by chemical approaches 
to be generated- and selected for desired function in small volumes. 

[0004] Acknowledging the power and efficiency of Nature's approach, researchers have used 
molecular evolution to generate many protems and nucleic acids with novel binding or catalytic 
properties (see, for example, J. Mmshull et al Ciirr, Opin. Chem. Biol 1999, J, 284-90; C. 
Schmidt-Dannert et al Trends Biotechnol 1999, 17, 135-6; D. S. Wilson et al AnniL Rev. 
Biochem. 1999, 68, 611-47). Proteins and nucleic acids evolved by researchers have 
demonstrated value as research tools, diagnostics, industrial res^ents, and therapeutics and have 
greatly expanded our understanding of the molecular interactions that endow proteins and 
nucleic acids with binding or catalytic properties {see, M. Famulok et al Curr. Opin. Chem. Biol 
1998,2,320-7). 

[0005] Despite nature's efficient approach to generating function, nature's molecular 
evolution i^ limited to two types of "natural" molecules — proteins and nucleic acids-7 because 
thus far the information in DNA can only be translated into proteins or into other nucleic acids. 
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However, many synthetic molecules of interest do not in general represent nucleic acid 
backbones, and the use of DNA-templated synthesis to translate DNA sequences into synthetic 
small molecules would be broadly useful only if syntlietic molecules other than nucleic acids and 
nucleic acid analogs could be synthesized in a DNA-templated fashion. An ideal approach to 
generating functional molecules would merge the most powerful aspects of molecular evolution 
with the flexibility of synthetic chemistry. Clearly, enabling the evolution of non-natural 
synthetic small molecules and polymers, similarly to the way nature evolves biomolecules, 
would lead to much more effective methods of discovering new synthetic ligands, receptors, and 
catalysts difficult or impossible to generate using rational design. 

Summary OF THE Invention 
[0006] The recognition of the need to be able to ampUfy and evolve classes of molecules 
besides nucleic acids and proteins led to the present invention providing methods and 
compositions for the template-directed synthesis, amplification, and evolution of molecules. In 
general, these methods use an evolvable template to direct the synthesis of a chemical compound 
or library of chemical compounds {i.e., the template actually encodes the synthesis of a chemical 
compound). Based on a library encoded and synthesized using a template such as a nucleic acid, 
methods are provided for amplifying, evolving, and screening the library. In certain 
embodiments of special interest, the chemical compounds are compounds that are not, or do not 
resemble, nucleic acids or analogs thereof. In certain embodiments, the chemical compounds of 
these template-encoded combinatorial Ubraries are polymers and more preferably ai*e unnatural 
polymers (/.e., excluding natural peptides, proteins, and polynucleotides). In other embodiments, 
the chemical compounds are small molecules. 

[0007] In certain embodiments, the method of synthesizing a compound or library of 
compounds comprises first providing one or more nucleic acid templates, which one or more 
nucleic acid templates optionally have a reactive unit associated therewith. The liucleic acid 
template is then contacted with one or more transfer units designed to have a first moiety, an 
anti-codon, which hybridizes to a sequence of the nucleic acid, and is associated with a second 
moiety, a reactive unit, which includes a building block of the compound to be synthesized. 
Once these transfer units have hybridized to the nucleic acid template in a sequence-specific 
manner, the synthesis of the chemical compound can take place due to the interaction of reactive 
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moieties present on the transfer units and/or the nucleic acid template. Signficantly, the 
sequence of the nucleic acid can later be determined to decode the synthetic history of the 
attached compound and thereby its structure. It will be appreciated tliat the method described 
herein may be used to synthesize one molecule at a time or may be used to synthesize thousands 
to millions of compounds using combinatorial methods. 

[0008] It will be appreciated tliat libraries synthesized in this manner (i.e., having been 
encoded by a nucleic acid) have the advantage of being amplifiable and evolvable. Once a 
molecule is identified, its nucleic acid template besides acting as a tag used to identify the 
attached compound can also be amplified using standard DNA techniques such as the 
polymerase chain reaction (PGR). The amplified nucleic acid can then be used to synthesize 
more of the desired compound. In certain embodiments, during the amplification step mutations 
are mtroduced mto the nucleic acid in order to generate a population of chemical compounds that 
are related to the parent compound but are modified at one or more sites. The mutated nucleic 
acids can then be used to synthesize a new library of related compounds. In tliis way, the library 
being screened can be evolved to contain more compounds with the desired activity or to contain 
compounds with a higher degree of activity. 

[0009] The methods of die present invention may be used to synthesize a wide variety of 
chemical compounds. In certain embodiments, the methods are used to synthesize and evolve 
unnatural polymers (ie,, excluding polynucleotides and peptides), which carmot be amplified 
and evolved using standard techniques currently available. In certain other embodiments, the 
mventive methods and compositions are utilized for the synthesis of small molecules that are not 
typically polymeric. In still otlier embodiments, the method is utilized for the g^nerateion of 
non-natural nucleic acid polymers. 

[0010] The present mvention also provides the transfer molecules (e.g., nucleic acid 
templates and/or transfer units) usefid in the practice of the inventive methods. These transfer 
molecules typically include a portion capable of hybridizmg to a sequence of nucleic acid and a 
second portion with monomers, other building blocks, or reactants to be incorporated into the 
final compound being synthesized. It will be appreciated that the two portions of the transfer 
molecule are preferably associated ^yith each other either directly or through a linker moiety. It 
will also be appreciated that the reactive unit arid the anti-codon may be present in the same 
molecule (e.g., a non-natural nucleotide having fimctionality incorporated therein). 
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[0011] The present invention also provides kits and compositions useful in the practice of the 
inventive methods. These kits may include nucleic acid templates, transfer molecules, 
monomers, solvents, buffers, enzymes, reagents for PGR, nucleotides, small molecule scaffolds, 
etc. The lot may be used in the synthesis of a particular type of unnatural polymer or small 
molecule. 

Definitions 

[00121 The term antibody refers to an immimoglobulin, whether natural or wholly or 
partially synthetically produced All derivatives thereof wliich maintain specific binding ability 
are also included in the term. The term also covers any protein having a binding domain which 
is homologous or largely homologous to an unmunoglobulin binding domain. These proteins 
may be derived from natural sources, or partly or wholly synthetically produced. An antibody 
may be monoclonal or polyclonal. The antibody may be a member of any immunoglobulin 
class, including any of the human classes: IgG, IgM, IgA, IgD, and IgE. Derivatives of the IgG 
class, however, are preferred in the present invention. 

[0013] The term, associated with, is used to describe the mteraction between or among two 
or more groups, moieties, compounds, monomers, etc. When two or more entities are 
"associated with" one another as described herein, they are linked by a dkect or indirect covalent 
or non-covalent interaction. Preferably, the association is covalent. TJie covalent association 
may be through an amide, ester, carbon-carbon, disulfide, carbamate, ether, or carbonate linkage. 
The covalent association may also include a linker moiety such as a photocleavable linker. 
Desirable non-^ovalent interactions include hydrogen bonding, van der Waals interactions, 
hydrophobic interactions, magnetic interactions, electrostatic interactions, etc. Also, two or more 
entities or agents may be "associated" with one another by being present together in the same 
compositiorL 

[0014] A biological macromolecule is a polynucleotide (e.g., RNA, DNA, RNATDNA 
hybrid), protein, peptide, lipid, natural product, or polysaccharide. The biological 
imcromolecule may be naturally occurring or non-naturally occurring. In a preferred 
embodiment, a biological macromolecule has a molecular weight greater than 500 g/mol. 
[0015] Polynucleotide, nucleic acid, or oligonucleotide refers to a polymer of nucleotides. 
The polymer may include natural nucleosides (/.e,, adenosine, thymidine, guanosine, cytidine. 
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uridine, deoxyadenosine, deoxytliymidine, deoxyguanosine, and deoxycytidine), nucleoside 
analogs {e,g., 2-aminoadenosine, 2-thiothymidine, inosine, pyrrolo-pyrimidine, 3-meaiyl 
adenosine, 5-methylcytidine, . CS-bromouridine, C5-fluorouridine, C5-iodouridine, 
C5-propynyl-uridine, C5-propynyl-cytidine, C5-methylcytidine, T^ieazaadenosine, 
7-deazaguanosine, 8-oxoadenosine, 8-oxoguanosine, 0(6)-methylguanine, and 2-thiocytidine), 
chemically modified bases, biologically modified bases (e.g., methylated bases), intercalated 
bases, modified sugars (e.g., 2 -fluororibose, ribose, 2'-deoxyribose, arabinose, and hexose), or 
modified phosphate groups (^.g., phosphorothioates and 5' -N-phosphoramidite linkages). 
[0016] A protein comprises a polymer of amino acid residues linked together by peptide 
bonds. The term, as used herein, refers to proteins, polypeptides, and peptide of any size, 
structure, or fimction. Typically, a protein will be at least three amino acids long. A protein may 
refer to an individual protein or a collection of proteins. A protein may refer to a full-length 
protein or a fragment of a protein. Inventive proteins preferably contain only natural amino 
acids, although non-natural amino acids (/.e., compounds that do not occur in nature but that can 
be incorporated into a polypeptide chain; see, for example, 
http://wwwxco.caltech.edu/'-dadgrp/Unnatstructgif, which displays structures of non-natural 
amino acids that have been successfully incorporated into functional ion chamiels) and/or amino 
acid analogs as are known in the art may altematively be employed. Also, one or more of the 
amino acids in an inventive protein may be modified, for example, by the addition of a chemical 
entity such as a carbohydrate group, a hydroxyl group, a phosphate group, a famesyl group, an 
isofamesyl group, a fatty acid group, a linker for conjugation, fiinctionalization, or other 
modification, etc. A protein may also be a single molecule or may be a multi-molecular 
complex. Aproteinmay be just a fragment of a naturally occurring protein or peptide. A protein 
naay be naturally occurring, recombinant, or synthetic, or any combination of these. 
[0017] The term small molecule, as used herein, refers to a non-peptidic, non-oUgomeric 
organic compound either synthesized in the laboratory or found in nature. Small molecules, as 
used herein, can refer to compounds tliat are "natural product-like", however, the tenn "small 
molecule" is not limited to "natural product-like" compounds. Rather, a small molecule is 
typically characterized in that it possesses one or more of the following characteristics including 
having several carbon-carbon bonds, having multiple stereocenters, having multiple functional 
grqups, having at least two dififerent types of functional groups, and having a molecular weight 
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of less than 1500, although this characterization is not intended to be limiting for the purposes of 
the present invention. 

[0018] The term small molecule scaffold, as used herein, refers to a cheinical compound 
having at least one site for fiznctionalization. In a preferred embodiment, the small molecule 
scaffold may have a multitude of sites for functionalization. These fimctionalization sites may 
be protected or masked as would be appreciated by one of skill in this art. The sites may also be 
found on an underlying ring structure or backbone. 

[0019] The term transfer unit, as used herein, refers to a molecvde comprising an anti-codon 
moiety associated with a reactive unit, including, but not limited to a building block, monomer, 
monomer unit, or reactant used in synthesizing the nucleic acid-encoded molecules. 

Description of the Figures 
[0020] Figure 1 depicts nature's approach (gray) and the classical chemical approach (black) 
to generating molecular function. 

[0021] Figure 2 depicts certain DNA-templated reactions for nucleic acids and analogs 
thereof. 

[0022] Figure 3 depicts the general method for synthesizing a polymer using nucleic acid- 
templated synthesis. 

[0023] Figure 4 shows a quadruplet and triplet non-frameshifting codon set. Each set 
provides nine possible codons. 

[0024] Figure 5 shows methods of screening a library for bond-cleavage and bond-formation 
catalysts. These methods take advantage of streptavidin's natural affinity for biotin. 
[0025] Figure 6A depicts the synthesis directed by hairpin (H) and end-of-helix (E) DNA 
templates. Reactions were analyzed by denaturing PAGE after the indicated reaction times. 
Lanes 3 and 4 contained templates quenched with excess P-mercaptoethanol prior to reaction. . 
[0026] Figure 6B depicts matched (M) or mismatched (X) reagents linked to thiols (S) or 
primary anwnes (N) were mixed with 1 equiv of template fiinctionalized with the variety of 
electrophiles shown. Reactions with thiol reagents were conducted at pH 7,5 under the following 
conditions: SIAB and SBAP: 37°C, 16 h; SIA: 25°Q 16 h, SMCC, GMBS, BMPS, SVSB: 
25°C, 10 min. Reactions witli amine.reagents were conducted at pH 8.5 for 75 minutes. 
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[0027] Figure 7 depicts (a) H templates linked to a-iodoacetamide group v^ch were reacted 
with thiol reagents containing 0, 1, or 3 mismatches at 25°C. (b) Reactions in (a) were repeated 
at the indicated temperature for 16 h. Calculated reagent Tm: 38°C (matched), 28°C (single 
mismatch). 

[0028] Figure 8 depicts a reaction performed using a 41-base E template and a 10-base 
reagent designed to anneal 1-30 bases from the 5' end of tlae template. The kinetic profiles in the 
graph show the average of two trials (deviations < 10%). The "n = 1 mis" reagent contams three 
mismatches. 

[0029] Figure 9 depicts the repeated n = 10 reaction in Figure 8 in which the nine bases 
followmg the 5'-NH2-dT were replaced with the backbone analogues shown. Five equivalents 
of a DNA oligonucleotide complementary to the intervening bases Were added to the "DNA + 
clamp" reaction. Reagents were matched (0) or contained three mismatches (3). The gel shows 
reactions at 25°C after 25 min. 

[0030] Figure 10 depicts the n = 1, n = 10, and n = 1 mismatched (mis) reactions described 
in Figure 8 which were repeated witli template and reagent concentrations of 12.5, 25, 62.5 or 
125 nM. 

[0031] Figure . 11 depicts a model translation, selection and amplification of synthetic 
molecules that buid streptavidin from a DNA-encoded Ubrary. 

[0032] Figure 12 depicts (a) Lanes 1 and 5: PCt: amplified library before streptavidin 
binding selection. Lanes 2 and 6: PGR amplified library after selection. Lanes 3 and 7: PGR 
amplified authentic biotin-encoding template. Lane 4: 20 bp ladder. Lanes 5-7 were digested 
with Tsp45I, . DNA sequencing traces of the amplified templates before and after selection are 
also shown, together with the sequences of the non-biotin encoding and biotin-encoding 
templates, (b) General scheme for the creation and evolution of libraries of non-natural 
molecules using DNA-templated synthesis, where -Ri represents the library of product 
functionality transferred from reagent library I and -Rib represents a selected product. 
[0033] Figure 13 depicts exemplary DNA-templated reactions. For all reactions under the 
specified conditions, product yields of reactions with matched template and reagent sequences 
were greater than 2()-fold higher thain that of control reactions with scrambled reagent sequences. 
Reactions were conducted at 25°C witli one equivalent each of template and reagent at 60 nM 
final concentration unless otherwise specified. Conditions: a) 3 mM NaBHsCN, 0.1 M MES 
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buffer pH 6.0, 0.5 MNaCl, 1.5 h; b) 0.1 M TAPS buffer pH 8.5, 300 mMNaCl, 12 h; c) 0.1 M 
pH 8.0 TAiPS buffer, 1 M NaCl, 5°C, 1.5 h; d) 50 mM MOPS buffer pH 7.5, 2.8 M NaCl, 22 h; 
e) 120 nM 19, 1.4 mM NaiPdCU, 0.5 MNaOAc buffer pH 5.0, 18 h; f) Preniix Na2PdCl4 with 
two equivalents of P(p-S03C6H4)3 in water 15 min., then add to reactaiits in 0.5 M NaOAc buffer 
pH 5.0, 75 toM NaCl, 2 h (j5nal [Pd] = 0.3 naM, [19] = 120 nM). The olefm geometry of 
products from 13 and the regiochemistries of cycloaddition products from 14 and 16 are 
presumed but not verified. 

[0034] Figure 14 depicts analysis by denaturing polyacrylamide gel electrophoresis of 
representative DNA-templated reactions listed in Figures 13 and 15. The structures of reagents 
and templates correspond to the numbering in Figures 13 and 15. Lanes 1, 3, 5, 7, 9, 11: 
reaction of matched (complementary) reagents and templates under conditions listed in Figures 
13 and 15 (the reaction of 4 and 6 was mediated by DMT-MM). Lanes 2, 4, 6, 8, 10, 12: 
reaction of mismatched (non-complementary) reagents and templates under conditions identical 
to those in lanes 1, 3, 5^ 7, 9 and 1 1, respectively. 

[0035] Figure 15 depicts DNA-templated amide bond formation mediated by EDC and 
sulfo-NHS or by DMT-MM for a variety of substituted carboxylic acids and amines. In each 
row, yields of DMT-MM-mediated reactions between reagents and templates complementary in 
sequence are followed by yields ofEDC and sulfo-NHS-mediated reactions. Conditions: 60 nM 
template, 120 nM reagent, 50 mM DMT-MM in 0.1 .M MOPS buffer pH 7,0, 1 M NaCl, 16 h, 
25°C; or 60 nM template, 120 nM reagent, 20 mM EDC, 15 mM sulfo-NHS, 0,1 M MES buffer 
pH 6.0, 1 M NaCl, 16 h, 25°C. In all cases, control reactions with mismatched reagent 
sequences yielded little or no detectable product. 

[0036] Figure 16 depicts (a) Conceptual model for distance-independent DNA-templated 
synthesis. As flie distance between the reactive groups of an armealed reagent and template (n) is 
increased, the rate of bond formation is presumed to decrease. For those values of n in which the 
rate of bond formation is significantly higher than the rate of template-reagent annealing, the rate 
of product formation remains constant In this regime, the DNA-templated reaction shows 
distance independence, (b) Denaturmg polyacrylamide gel electrophoresis of a DNA-templated 
Wittig olefination between complementary II and 13 with either zero bases (lanes 1-3) or ten 
bases (lanes 4-6) separating annealed reactants. Although the apparent second order rate 
constants for the n = 0 and n = 10 reactions differ by three-fold (kapp (n = 0) = 9.9 x IO3 M"*s"^ 
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while kapp (n = 10) = 3.5 x 10"^ IvT^s*^), product yields after 13 h at both distances are nearly 
quantitative. Control reactions containing sequence mismatches yielded no detectable product 
(not shown). 

(0037] Figure 17 depicts certain exemplary DNA-templated complexity building reactions. 
[0038] Figure 18 depicts certain exemplary linkers for use in the method of the invention. 
[0039] Figure 19 depicts certain additional exemplary linkers for use in the method of the 
invention. 

[0040] Figure 20 depicts an -exemplary tliioester linker for lise in the method of the 
invention. 

[0041] Figure 21 depicts DNA-templated amide bond formation reactions in which reagents 
and templates are complexed with dimethyldidodecylammonium cations. 
[0042] Figure 22 depicts the assembly of transfer units along the nucleic acid template and 
polymerization of the nucleotide anti-codon moieties. 

[0043] Figure 23 depicts the polymerization of the dicarbamate units along the nucleic acid 
template to form a polycarbamate. To mitiate polymerization the "start" monomer ending in a o- 
nitrobenzylcarbamate is photodeprotected to reveal the primary amine that initiates carbamate 
polymerization. Polymerization then proceeds in the 5' to 3' direction along the DNA backbone, 
with each nucleophilic attack resultuig in the subsequent unmasking of a new amine nucleophile. 
Attack of the "stop" monomer liberates an acetamide rather than an amine, thereby terminating 
polymerization. 

[0044] Figure 24 depicts cleavage of the polycarbamate from the nucleotide backbone. 
Desilylation of the enol ether linker attaching the anti-codon moiety to the monomer unit and tlie 
elimination of phosphate driven by the resulting release of phenol provides the provides the 
polycarbamate covalently linked at its carboxy terminus to its encoding single-stranded DNA. 
[0045] Figure 25 depicts components of an amplifiable, evoivable furictionalized peptide 
nucleic acid library. 

[0046] Figure 26 depicts test reagents used to optimize reagents and conditions for DNA- 
templated PNA coupling. 

[0047] Figure 27 depicts a simple set of PNA monomers derived from commercially 
available building blocks useful for evolving a PNA-based fluorescent Ni^"*" sensor. 
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[0048] Figure 28 depicts two schemes for the selection of a biotin-terminated functionalized 
PNA capable of catalyzing an aldol or retroaldol reaction. 

[0049] Figure 29 depicts DNA-template-directed synthesis of a combinatorial small 
molecule library. 

[0050] Figure 30 shows schematically how DNA-linked small molecule scaffolds can be 
functionalized sequence-specfiically by reaction with synthetic reagents linked to 
complementary nucleic acid oligonucleotides, this process can be repeated to complete the 
synthetic transformations leading to a fully functionalized molecule. 

[0051] Figure 31 shows the fimctionalization of a cephalosporin small molecule scaffold 
with various reactants. 

[0052] Figure 32 depicts a way of measuring the rate of reaction between a fixed 
nucleophile and an electrophile hybridized at varying distances along a nucleic acid template to 
define an essential reaction window in which nucleic acid-templated synthesis of nonpolymeric 
structures can take place. 

[0053] Figure 33 depicts tliree linker strategies for DNA-templated synthesis. In the 
autocleaving linker strategy, the bond connecting the product from the reagent oligonucleotide is 
cleaved as a natural consequence of the reaction. In tlae scarless and useflil scar linker strategies, 
this bond is cleaved following the DNA-templated reaction. The depicted reactions were 
analyzed by denaturing polyacrylamide gel electrophoresis (below). Lanes 1-3 were visualized 
using UV light without DNA staining; lanes 4-10 were visualized by staining witli ethidium 
bromide following by UV transillumination. Conditions: 1 to 3: one equivalent each of reagent 
and template, 0,1 M TAPS buffer pH 8.5, 1 M NaCl, 25 °C, 1.5 h; 4 to 6: three equivalients of 4, 
.0.1 M MES buffer pH 7.0, 1 M NaN02, 10 mM AgNOa, 37 ^C, 8 h; 8 to 9: 0.1 M CAPS buffer 
pH 11.8, 60 mM BME, 37 °C, 2 h; 11 to 12: 50 mM aqueous NaI04, 25 °C, 2 h. Ri = 
NH(CH2)2NH-dansyl; R2 = biotin. 

[0054] Figure 34 depicts strategies for purifying products of DNA-templated synthesis. 
Using biotinylated reagent oligonucleotides, products arising from an autocleaving linker are 
partially purified by washing the crude reaction with avidin-linked beads (top). Products 
generated firom DNA-terapIated reactions using the scarless or useful scar linkers can be purified 
hy using biotinylated reagent oligonucleotides, capturing crude reaction products with avidin- 
linked beads, and eluting desired products by inducing linker cleavage (bottom). 
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[0055] Figure 35 depicts the generation of an initial template pool for an exemplary library 
synthesis. 

[0056] Figure 36 depicts the DNA-templated synthesis of a non-natural peptide library. 
[0057] Figure 37 depicts a 5 '-reagent DNA-linker-amino acid^ 

[0058] Figure 38 depicts the DNA-templated synthesis of an evolvable diversity oriented 
bicycliclibrary. 

[0059] Figure 39 depicts DNA-templated multi-step tripeptide synthesis. Each DNA- 
templated amide formation used reagents containing the sulfone linker described in the text 
Conditions: step. 1: activate two equivalents 13 in 20 mM EDC, 15 mM sulfo-NHS, 0.1 M MES 
bujafer pH 5.5, 1 M NaCl, 10 min, 25 X, then add to template in 0.1 M MOPS pH 7.5, IM NaCl, 
25*^0, 1 h; steps 2 and 3: two equivalents of reagent, 50 mM DMT-MM, 0.1 M MOPS buffer pH 
7.0, 1 M NaCl, 6 h, 25 °C. Desired product after each step was purified by capturing on avidin- 
linked beads and eiuting with 0.1 M CAPS buffer pH 11.8, 60 mM BME, 37 °C, 2 h. The 
progress of each reaction and purification was followed by denaturing polyacrylamide gel 
electrophoresis (bottom). Lanes 3, 6, and 9: control reactions using reagents containing 
scrambled oligonucleotide sequences. 

[0060] Figure 40 depicts Nourpeptidic DNA-templated multi-step synthesis. The reagent 
linkers used in steps 1, 2, and 3 were the diol linker, autocleaving Wittig linker, and sulfone 
linker, respectively; see Figure 1 for linker cleavage conditions. Conditions: 17 to 18: activate 
two equivalents 17 in 20 mM EDC, 15 mM sulfo-NHS, 0.1 M MES buffer pH 5.5, 1 M NaCl, 10 
min, 25 °C, then add to template in 0.1 M MOPS pH 7.5, IM NaCl, 16^C, 8 h; 19 to 21: three 
equivalents 20, 0.1 M TAPS pH 9.0, 3 M NaCl, 48 h, 25 °C; 22 to 23: three equivalents 22, 0.1 
M TAPS pH 8.5, 1 M NaCl, 21 li, 25**C. The progress of each reaction and purification was 
followed by denaturing polyacrylamide gel electrophoresis (bottom). Lanes 3, 6, and 9: control 
reactions using reagents containing scrambled oligonucleotide sequences. 
[0061] Figure 41 depicts the use of nucleic acids to direct the synthesis of new polymers and 
plastics by attaching the nucleic acid to the ligand of a polymerization catalyst The nucleic acid 
can fold into a complex structure which can affect the selectivity and activity of the catalyst 
[0062] . Figure 42 depicts the use of Grubbs' ring-opening metathesis polymerization 
catalysis in evolving plastics. The synthetic scheme of a dihydroimidazole ligand attached to 
DNA is shown as well as the monomer to be used in the polymerization reactioit 
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[0063] Figure 43 depicts the evolution of plastics through iterative cycles of ligand 
diversification, selection and amplification to create polymers with desired properties, 
[0064] Figure 44 depicts an exemplary scheme for the synthesis, in vitro selection and 
amplification of a library of compounds. 

[0065] Figure 45 depicts exemplary templates for use in. recombination. 

[0066] Figure 46 depicts several exemplary deoxyribunucleotides and ribonucleotides 

bearing modifications to groups that do not participate in Watson-Crick hydrogen bonding and 

are known to be inserted with high sequence fidelity opposite natural DNA templates. 

[0067] Figure 47 depicts exemplary metal binding uridine and 7-deazaadenosine analogs. 

[0068] Figure 48 depicts the syntiiesis of analog (7). 

[0069] Figure 49 depicts the synthesis of analog (30). 

[0070] Figure 50 depicts the synthesis of 8-modified deoxyadenosine triphosphates. 

[0071] ' Figure 51 depicts the results of an assay evaluating the acceptance of modified 

nuceotides by DNA polymerases. 

[0072] Figure 52 depicts the syntiiesis of 7-deazaadenosine derivatives. 

[0073] Figure 53 depicts certain exemplary nucleotide triphosphates. 

[0074] Figure 54 depicts a general method for the generation of libraries of metal-binding 

polymers, 

[0075] Figures 55 and 56 depict exemplary schemes for the in vitro selections for non- 
natural polymer catalysts^ 

[0076] Figure 57 depicts an exemplary scheme for the in vitro selection of catalysts for Heck 
reactions, hetero Diels-Alder reactions and aldol additions. 

Description of Certain Embodiments of the Invention 
[0077] As discussed above, it would be desirable to be able to evolve and amplify chemical 
compounds including, but not limited to small molecules and polymers, in the same way that 
biopolymers such as polynucleotides and proteins can be ainplified and evolved. It has been 
demonstrated that DNA-templated synthesis provides a possible means of translating the 
informatipn m a sequence of DNA iiito a synthetic small molecule. In general, DNA templates 
linked to one reactant may be able to recruit a second reactive; group linked to a complementary 
DNA molecule to yield a product. Since DNA hybridization is sequence-specific, the result of a 
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DNA-templated reaction is the translation of a specific DNA sequence into a corresponding 
reaction product. As shown in Figure 2, tlie ability of single-stranded nucleic acid teniplates to 
catalyze the sequence-specific oligomerization of complementary oligonucleotides (T. Inoue ei 
Ml 1 Am, Chem. Soc. 1981, 103, 7666; T. Inou etal J. Mol Biol 1984, 178, 669-76) has been 
demonstrated. This discovery was soon followed by findings that DNA or RNA templates could 
catalyze the oligomerization of complementary DNA or RNA mono-, di-, tri-, or 
oligonucleotides (T, Inoue et al 1 Ant Chem. Soc. 1981, 103, 7666; L. E. Orgel et al Acc. 
Chem. Res. 1995, 28, 109-118; H. Rembold et al J. Mol Evol 1994, 38, 205; L. Rodriguez et al 
J. Mol Evol 1991, 33, 477; C. B. Chen etal J. Mol Biol 1985, 181, 271). DNA or RNA 
templates have since been shown to accelerate the formation of a variety of non-natural nucleic 
acid analogs, including peptide nucleic acids (C. Boliler et al Nature 1995, 376, 578), 
phosphorothioate- (M. K. Herrleih et al J. Ant Cheiit Soc. 1995, 117, 10151-10152), 
phosphoroselenate- (Y. Xu et al J. Am. Chem. Soc. 2000, 122, 9040-9041; Y. Xu et al Nat. 
Biotechnol 2001, 19, 148-152) and phosphoramidate-(A. Luther etal Nature 1998, 396, 245-8) 
containing nucleic acids, non-ribose nucleic acids (M. Bolli etal Ghent Biol 1997, 4, 309-20), 
and DNA analogs in which a phosphate linkage has been replaced with an aminoethyl group (Y. 
Gat et al Biopolymers 1998, 48, 19-28). Nucleic acid templates can also catalyze amine 
acylation between nucleotide analogs (R. K. Bruick et al Chem. Biol 1996, J, 49-56).. 
[0078] However, although the ability of nucleic acid templates to accelerate the formation of 
a variety of non-natural nucleic acid analogues has been demonstrated, nearly all of these 
reactions previously shown to be catalyzed by nucleic acid templates were designed to proceed 
flirough transition states closely resembling the structure of the natural nucleic acid backbone 
(Fig. 2), typically affording products that preserve the same six-bond backbone spacing between 
nucleotide units. The motivation behind this design was prestimably the assumption that the rate 
eidiancement provided by nucleic acid templates depends on a precise alignment, of reactive 
groups, and the precision of this alignment is maximized when the reactants and products mimic 
the structure of the DNA and RNA backbones. Evidence in support of the hypothesis that DNA- 
templated synthesis can only generate products that resemble the nucleic acid backbone comes 
from the well-known diflBculty of macrocycUzation in organic synthesis (G. lUuminati et al. Acc. 
Chem. Res. mi, 14, 95-102; R. B. Woodward etal J. Am. Chem. Soc. 1981, 103, 3210-3213). 
The rate enhancement of intramolecular ring closing reactions compared with their 
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intennolecular counterparts is known to diminish quickly as rotatable bonds are added betWefen 
reactive groups, such that linking reactants with a flexible 14-carbon linker hardly affords any 
rate acceleration (G. lUuminati et al Acc. Chem. Res. 1981, 14, 95-102). 
[0079] Because synthetic molecules of interest do not in general resemble nucleic acid 
backbones, the use of DNA-templated synthesis to translate DNA sequences into synthetic small 
molecules would be broadly useful only if synthetic molecules other than nucleic acids and 
nucleic acid analogs coiild be synthesized in a DNA-templated fashion. The ability of DNA- 
templated synthesis to translate DNA sequences into arbitrary non-natural small molecules 
therefore requires demonstrating that DNA-templated synthesis is a much more general 
phenomenon than has been previously described. 

[0080] Signficaiidy, for the first time it lias been demonstrated herein that DNA-templated 
synthesis is indeed a general phenomenon and can be used for a variety of reactions and 
conditions to generate a diverse range of compounds, specifically including for the first time,, 
compounds that are not, or do not resemble, nucleic acids or analogs thereof. More specifically, 
the present invention extends the ability to amplify and evolve libraries of chemical compoimds 
beyond natural biopolymers. The ability to synthesize chemical compounds of arbitrary 
structure allows researchers to write their own genetic codes incorporating a wide range of 
chemical functionality into novel backbone and side-chain structures, which enables the 
development of novel catalysts, drugs, and polymers, to name a few examples. For example, the 
ability to directly amplify and evolve these molecules by genetic selection enables the discovery 
of entirely new families of artificial catalysts which possess activity, bioavailability, solvent, or 
thermal stability, or other physical properties (such as fluorescence, spin-labeling, or 
photolability) which are difficult or impossible to achieve using the limited set of natural protein 
and nucleic acid building blocks. Similarly, developing methods to amplify and directly evolve 
synthetic small molecules by iterated cycles of mutation and selection enables the isolation of 
novel ligands or drugs with properties superior to those isolated by traditional rational design or 
combinatorial screenmg drug discovery methods. Additionally, extending the approaches 
described herein to polymers of sigrdficance in material science would enable the evolution of 
new plastics. 

[0081] In general, the method of the invention involves 1) providing one or more nucleic 
acid templates, which one or more nucleic acid templates optionally have a reactive unit 
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associated therewith; and 2) contacting the one or more nucleic acid templates with one or more 
transfer units designed to.have a first moiety, an anti-codon which hybridizes to a sequence of 
the nucleic acid, and'is associated with a second moiety, a reactive unit, which includes specific 
functionality, a building block, reactant, etc. for the compound to be syflthesi2»d. It will be 
appreciated that m certain embodiments of the invention, the transfer unit comprises one moiety 
incorporating the hybridization capablility of the anti-codon unit and the cheniical fimctionality 
of the reaction unit. Once these transfer units have hybridized to the nucleic acid template in a 
sequence-specific manner, the synthesis of the cheinical compound can take place due to the 
interaction of reactive units present on the transfer units and/or the nucleic acid template. 
Significantly, the sequence of the nucleic acid can later be determined to decode the synthetic 
history of the attached compound and thereby its structure. It will be appreciated that the method 
described herein may be used to synthesize one molecule, at a time or may be used to synthesize 
thousands to millions of compounds using combinatorial methods. 

[0082] It will be appreciated that a variety of chemical compounds can be prepared and 
evolved according to the metliod of the invention. In certain embodiments of the invention, 
however, the methods are utilized for the synthesis of chemical compounds that are not, or do 
not, resemble nucleic acids or nucleic acid analogs. For example, in certain embodiments of the 
invention, small molecule compounds can be syntheiszed by providing a template which has a 
reactive unit (e.g., building block or small molecule scaffold) associated therewith (attached 
directly or through a linker as described in more detail in Examples 5 herein), and contacting the 
template simultaneously or sequentially with one or more transfer units having one or more 
reactive units associated therewith. In certain other embodiments,*non-natiiral polymers can be 
synthesized by providing a template and contactuig the template simultaneously with one or 
more transfer units having one or more reactive units associated therewith under conditions 
suitable to effect reaction of the adjacent reactive units on each of the transfer units (see, for 
example. Figure 3, and examples 5 and 9, as described in more detail herein). 
[0083] Certain embodiments are discussed m more detail below; however, it will be 
appreciated that the present invention is not intended' to be limited to those embodiments 
discussed below. Rather, the present invention is intended to encompass these embodiments and 
equivalents thereof 
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templates 

[0084] As discussed above, one or more templates are utilized in the method of the invention 
and hybridize to the transfer units to direct the synthesis of the chemical compound. As would 
be appreciated by one of skill in this art, any template may be used in the methods and 
compositions of the present invention. Templates wliich can be mutated and thereby evolved can 
be used to guide the synthesis of another chemical compound or library of chemical compounds 
as described in the present invention. As described in more detail herein, the evolvable template 
encodes the synthesis of a chemical compound and can be used later to decode the synthetic 
history of the chemical compound, to indirectly amplify the chemical compound, and/or to 
evolve {f.e., diversify, select, and amplify) the chemical compound. The evolvable template is, 
in certain embodiments, a nucleic acid. In certain embodiment of the present invention, the 
template is based on a nucleic acid. , 
[0085] The nucleic acid templates used in the present invention are made of DNA, RNA, a 
hybrid of DNA and RNA, or a derivative of DNA and RNA, and may be skigle- or double- 
stranded. The sequence of the template is used in the inventive method to encode the synthesis 
of a chemical compound, preferably a compound that is not, or does not resemble, a nucleic acid 
or nucleic acid analog (e.g., an unnatural polymer or a small molecule). In the case of certain 
mmatural polymers, the nucleic acid template is used to align the monomer units in the sequence 
they will appear in the polymer and to bring them in close proximity with adjacent monomer 
units aloiig the template so that they will react and become joined by a covalent bond. In the 
case of a small molecule, the template is used to bring particular reactants within proximity of 
the small molecule scaffold in order that they may modify the scaffold in a particular way. In 
certain other embodunents, the template can be utilized to generate non-natural polymers by 
PGR amplification of. a synthetic DNA template library consisting of a random region of 
nucleotides, as describe in Example 9 herein. 

[0086] As would be appreciated by one of skill in the art, the sequence of tlie template may 
be designed in a number of ways without going beyond the scope of the present invention. For 
example, the length of the codon must be detemimed and the codon sequences must be set. If a 
codon length of two is used, then using the four naturally occurring bases only 16 possible 
combinations are available to be used in encoding the .library. If the length of the codon is 
increased to three (the number Nature uses in encoding proteuas), the number of possible 
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combinations is increased to 64. Other factors to be considered in determining the length of the 
codon are mismatching, frame-shifting, complexity of Ubrary, etc. As the length of the codon is 
increased up to a certain extent the number of mismatches is decreased; however, excessively 
long codons will hybridize despite mismatched base pairs. In certam embodiments of special 
interest, the leagth of the codon ranges between 2 and 10 bases. 

[0087] Another problem associated with using a nucleic acid template is frame shifting. In 
Nature, the problem of frame-shifting in the translation of protein from an mRNA is avoided by 
use of the complex machinery of the ribosome. The inventive methods, however, will not take 
advantage of such complex machinery. Instead, frameshifting may be remedied by lengthening 
each codon such that hybridization of a codon out of frame will guarantee a mismatch. For 
example, each codon may start with a G, and subsequent positions may be restricted to T, C, and 
A (Figure 4). In another example, each codon may begin and end with a G, and subsequent 
positions may be restricted to T, C, and A. Another way of avoiding frame shifting is to have the 
codons sufficiently long so that the sequence of the codon is only found within the sequence of 
the template "in frame". Spacer sequences may also be placed in between the codons to prevent 
frameshifting. 

[0088] ' Itwillbeappreciatedttiatthetemplatecanvary greatly in the number of bases. For 
example, in certain embodiments, the template may be 10 to 10,000 bases long, preferably 
between 10 and 1,000 bases long. Tlie length of tlie template will of course depend on the length 
of the codons, complexity of the library, length of the unnatural polymer to be synthesized, 
complexity of the small molecule to be synthesized, use of space sequences, etc. The nucleic 
acid sequence may be prepared using any method known in tlie art to prepare nucleic acid 
sequences. These methods include both in vivo and w vitro methods including PGR, plasmid 
preparation, endonuclease digestion, solid phase synthesis, in vitro transcription, strand 
separation, etc. In certain embodiments, the nucleic acid template is synthesized using an 
automated DNA synthesizer. 

[0089] As discussed above, in certain embodiments of the invention, the method is used to 
synthesize chemical compounds that are not, or do not resemble, nucleic acids or nucleic acid 
analogs. Although it has been demonstrated that DNA-templat^ synthesis can be utilized to 
direct the synthesis of nucleic acids and analogs thereof, it has not been previously demonstrated 
that the phenomenon of DNA-tempalted synthesis is general enough to extend to other more 
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complex chemical compounds (e.g., small molecules, non-natural polymers). As described in 
detail herein, it has been demonstrated that DNA-templated synthesis is indeed a more general 
phenomenon and that a variety of reactions can be utilized. 

[0090] Thus, in certain embodiments of the present invention, the nucleic acid template 
comprises sequences of bases that encode the synthesis of an unnatural polymer or small 
molecule. The message encoded in tlie nucleic acid template preferably begins with a specific 
codon ,that bring into place a chemically reactive site from which the polymerization can take, 
place, or in the case of synthesizing a small molecule the "start" codon may encode for an anti- 
codon associated with a small molecule scafifold or a first reactant. The "start" codon of the 
present invention is analogous to the "start" codon, ATG, found in Nature, which encodes for the 
amino acid methionine. To give but one example for use in synthesizing an unnatural polymer 
library, the start codon may encode for a start monomer unit comprising a primary amine masked 
by a photolabile protecting group, as shown below in Example SA. 

[0091] In yet other embodiments of the invention, the nucleic acid template itself may be 
modified to include an initiation site for polymer synthesis (e.g., a nucleophile) or a small 
molecule scaffold. In certain embodiments, the nucleic acid template includes a hairpin loop on 
one of its ends terminating in a reactive group used to initiate polymerization of the monomer 
units.. For example, a DNA template may comprise a haiipin loop terminating in a 5'-andno 
group, which may be protected or not. From the amino group polymerizatioh of the unnatural 
polymer may commence. The reactive amino group can also be used to link a small molecule 
scaffold onto the nucleic acid template in order to synthesize a small molecule hbrary. 
[0092] To terminate the synthesis of the unnatural polymer a "stop" codon should be 
included in the nucleic acid template preferably at the end of the encoding sequence. The "stop" 
codon of the present invention is analogous to the "stop" codons (re., TAA, TAG, TGA) found 
in mRNA transcripts. In Nature, these codons lead to .the tennmation of protein synthesis. In 
certain embodiments, a "stop" codon is chosen that is compatible with the artificial genetic code 
used to encode the unnatural polymer. For example, the "stop" codon should not conflict with 
any other codons used to encode the synthesis, and it should be of the same general format as the 
other codons used in the template. The "stop" codon may encode for a monomer unit that 
temunates polymerization by not providing a reactive group for fiirther attachment For 
example, a stop monomer unit may contain a blocked reactive group such as an acetamide rather 
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than a primary amine as shown in Example 5A below. In yet other embodiments, the stop 
monomer unit comprises a biotinylated terminus providing a convenient way of terminating the 
polymerization step and purifying the resulting polymer. . 

Transfer Units 

{0093J As described above, in the method of the invention, transfer units are also provided 
which comprise an anti-codon and a reactive unit It ^^all be appreciated that the anti-codons 
used in the present invention are designed to be complementary to the codons present within the 
nucleic acid template, and should be designed with the nucleic acid template and the codons used 
therein in mind. For example, the sequences used in the template as well as the length of the 
codons would need to be taken into account in designing the anti-codons. Any molecule which 
is complementary to a codoh used in the template may be used in the inventive methods (e.g., 
nucleotides or non-natural nucleotides). In certain embodiments,, the codons comprise one or 
more bases found in nature (z.e., thymidine, uracil, guanidine, c)^osine, adenine). In certain 
other embodiments, the anti-codon comprises one or more nucleotides normally found in Nature 
with a base, a sugar, and an optional phosphate group. In yet other embodiments, the bases are 
strung out along a backbone that is not the sugar-phosphate backbone normally found in Nature 
(e.g., non-natural nucleotides). 

[0094] As discussed above, the anti-codon is associated with a particular type of reactive unit 
to form a transfer unit. It will be appreciated that this reactive imit may represent a distinct entity 
or may be part of the functionality of the anti-codon unit (see. Example 9). In certain 
embodiments, each anti-codon sequence is associated with one monomer type. For example, the 
ianti-codon sequence ATTAG may be associated with a carbamate residue with an iso-butyl side 
chain, and the anti-codon sequence CATAG may be associated with a carbamatie residue with a 
phenyl side chain. Tliis one-for-one mapping of anti-codon to monomer units allows one to 
decode any polymer of the library by sequencing the nucleic acid template used in the synthesis 
and allows one to synthesize die same polymer or a related polymer by knowing the sequence of 
the original polymer. It will be appreciated by . one of skill in this art that by changing (e,g., 
mutating) the sequence of the template, different monomer units will be brought into place, 
thereby allowing the synthesis of related polymers, which can subseqiienfly be selected and 
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evolved. In certain preferred embodiments, several anti-codons may code for one monomer unit 
as is the case in Nature. 

[0095] In certain other embodiments of tlie present invention where a small molecule library 
is to be created rather than a polymer library, the anti-codon is associated with a reactant used to 
modify the small molecule scaffold In certain embodiments, the reactant is associated with the 
anti-codon through a luiker long enough to allow the reactant to come in contact with the small 
molecule scaffold. The linker should preferably be of such a length and composition to allow for 
intramolecular reactions and minimize intermolecular reactions. The reactants include a variety 
of reagents as demonstrated by the wide range , of reactions that can be utilized in DNA- 
templated synthesis (see Example 2, 3 and 4 herein) and can be any chemical group, catalyst 
organometallic compounds), or reactive moiety (e;g., electrophiles, nucleophiles) known in 
the chemical arts. 

[0096] Additionally, the association between the anti-codon and the monomer unit or 
reactant in the transfer um't may be covalent or non-covalent. In certain embodiments of special 
mtereste, the association is through a covalent bond, and in certain embodiments the covalent 
linkage is severable. The linkage may be cleaved by. light, oxidation, hydrolysis, exposure to 
acid, exposure to base, reduction, etc. For examples of linkages used in this art, please see 
Fnichtel et al Angew. Chem. Int. Ed. Engl 35:17, 1996, incorporated herein by reference. The 
anti-codon and the monomer unit or reactant may also be associated through non-covalent 
interactions such as ionic, electrostatic, hydrogen bonduig, van der Waals interactions, 
hydrophobic interactions, pi-stacking, etc. and combinations thereof To give but one example, 
the anti-codon may be linked to biotin, and the monomer unit linked to streptavidin. The 
propensity of streptavidin to bind biotin leads to the non-covalent association between the anti- 
codon and the monomer unit to form the transfer unit. 

Synthesis of Certain Exemplary Compounds 

[0097] It will be appreciated that a variety of compounds and/or libraries can be prepared 
using the method of the invention. As discussed above, m certain embodiments of special 
interest, compounds that are not, or do not resemble, nucleic acids or. analogs thereof, are 
synthesized according to the method of the invention. 
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[0098] In certain embodiments, polymers, specifically unnatural polymers, are prepared 
according to the method of the present invention. The unnatural polymers that can be created 
using the inventive method and system include any unnatural polymers. Exemplary uimatura! 
polymers include, but are not limited to, polycarbamates, polyureas, polyesters, polyacrylate, 
polyalkylene (e.g., polyethylene, polypropylene), polycarbonates, polypeptides with unnatural 
stereochemistry,, polypeptides with imnatural amino acids, and combmation thereof. In certain 
embodiments, the polymers comprises at least 10 monomer units. In certain other embodiments, 
the polymers comprise at least 50 monomer units. In yet other embodiments, the polymers 
comprise at least 100 monomer units. The polymers synthesized using the inventive system may 
be used as catalysts, pharmaceuticals, metal chelators, materials, etc, 

[0099] In preparing certain lumatural polymers, the monomer units attached to the anti- 
codons and used in the present invention may be any monomers or oligomers capable of being 
joined together to form a polymer. The monomer imits may be carbamates, D-amino acids, 
unnatural amino acids, ureas, hydroxy acids, esters, carbonates, acrylates, ethers, etc. In certain 
embodiments, the monomer units have two reactive groups used to link the monomer unit into 
the growing polymer chain. Preferably, the two reactive groups are not the same so tliat the 
monomer unit may be incorporated into the polymer in a directional sense, for example, at one 
end may be an electrophile and at the other end a nucleophile. Reactive groups may include, but 
are not limited to, esters, amides, carboxylic acids, activated carbonyl groups, acid chlorides, 
amines, hydroxyl groups, thiols, etc. In certain embodiments, the reactive groups are masked or 
protected (Greene & Wuts Protective Groups in Organic Synthesis, 3rd Edition Wiley, 1999; 
incorporated herein by reference) so that polymerization may not take place until a desired time 
when the reactive groups are deprotected. Once the monomer units are assembled along the 
nucleic acid template, mitiation of the polymerization sequence results in a cascade of 
polymerization and deprotection steps wherein the polymerization step results in deprotection of 
a reactive group to be used in the subsequent polymerization step (see. Figure 3), 
[00100] The monomer units to be polymerized may comprise two or more units depending on 
the geometry along the nucleic acid template. As would be appreciated by one of skill in this art, 
the monomer units to be polymerized must be able to stretch along the nucleic acid template an:d 
particularly across the distance spanned by its encoding anti-codon and optional spacer sequence. 
In certain embodiments^ the monomer unit actually comprises two monomers, for example, a 
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dicarbamate, a diurea, a dipeptide, etc. In yet oUier embodiments, the monomer imit actually 
comprises three or more monomers. 

[00101] The monomer units may contain any chemical groups known in the art. As would be 
appreciated by one of skill in this art, reactive chemical groups especially those that would 
interfere with polymerization, hybridization, eic, are masked using known protecting groups 
(Greene & Wuts Protective Groups in Organic Synthesis, 3rd Edition Wiley, 1999; incorporated 
herein by reference). In general, the protectmg groups xised to mask these reactive groups are 
orthogonal to those used in protecting the groups used in the polymerization steps. 
[00102] In synthesizing an unnatural polymer, in certain embodiments, a template is provided 
encoding the sequence of monomer units. Transfer units are then allow to contact the template 
under conditions tliat allow for hybridization of the anti-codons to the template. Polymerization 
of the monomer units along the template is then allowed to occur to form the unnatural polymer. 
The newly synthesized polymer may then be cleaved from the anti-codons and/or the template. 
The template may be used as a tag to elucidate the structure of the polymer or may be used to 
amplify and evolve the unnatural polymer. As will be described in more detail below, the 
present method may be used to prepare a library of unnatural polymers. For example, in certain 
embodiments, as described in more detail in Example 9 herein, a library of DNA templates can 
be utilized to prepare unnatural polymers. In general, the method takes advantage of the fact that 
certain DNA polymerases are able to accept certain modified nucleotide triphosphate substrates 
and that several deoxyribonucleotides and ribonucleotides bearing modified groups that do not 
participate in Watson-Crick bonding are known to be inserted with high sequence specificity 
opposite natural DNA templates. Accordingly, single stranded DNA containing modified 
nucleotides can serve as eflBcient templates for the DNA-polymerase catalyzed incorporation of 
natural or modified nucleotides. 

[00103] It will be appreciated that the inventive methods may also be used to synthesize other 
classes of chemical compounds besides unnatural polymers. For example, small molecules may 
be prepared using the methods and compositions provided by the present invention. These small 
molecules may be natural product-like, non-polymeric, and/or non-oligomeric. The substantial 
interest in small molecules is due in part to their use as the active ingredient in many 
pharmaceutical preparations although diey may also be used as catalysts, materials, additives, 
etc, ^ " 
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[00104] In synthesizing small molecules using the method of the present invention, an 
evolvable template is also provided. The template may either comprise a small molecule 
scaffold upon which the small molecule is to be built, or a small molecule scaffold may be added 
to the template. The small molecule scaffold may be any chemical compound with sites for 
functionalization. For example, the small molecule scaffold may comprises a ring system (e.g., 
the ABCD steroid ring system found, in cholesterol) with functionalizable groups off the atoms 
making up the rings. In another example, the small molecule may be the underlying structure of 
a pharmaceutical agent such as morphine or a cephalosporin antibiotic (see Examples 5C and 5D 
below below). The sites or groups to be functionalized on the small molecule scaffold may be 
protected using methods and protecting groups known in the art. The protecting groiq)S used in a 
small molecule scaffold may be orthogonal to one another so that protecting groups can be 
removed one at a time. 

[00105] In this embodiment, the transfer units comprise an anti-codon similar to those 
described in the xmnatural polymer synthesis; however, these anti-codons are associated with 
reactants or building blocks to be used in modifying, adding to, or taking away from the- small 
molecule scaffold. The reactants or building blocks may be electrophiles (e.g., acetyl, amides, 
acid chlorides, esters, nitriles, imines), nucleophiles (e.g., anndnes, hydroxyl groups, thiols), 
catalysts (e.g., organometallic catalysts), side chains, etc. See, for example reactions in aqueous 
and organic media as described herein in Examples 2 and 4. The ti;ansfer units are allowed to 
contact the template under hydridizing conditions, and the attached reactant or building block is 
allowed to react with a site on the small molecule scaiFfold, In certaLu embodiments, protecting 
groups on the small molecule template are removed one at a time from the sites to be 
functionalized so that the reactant of the transfer unit will react at only the desired position on the 
scaffold. As will be appreciated by one of skill in the art, the anti-codon may be associated with 
the reactant through a linker moiety (see. Example 3). The linker facilitates contact of the 
reactant with the small molecule scaffold md in certmn embodunents, depending on the desired 
reaction, positions DNA as a leaving group C*autocleavable" strategy), or may link reactive 
groups to the template via the "scarless" linker strategy (v^ch yields product without leaving 
^ behind additional chenlical functionality), or a "useful scar" strategy (in which the linker is left 
behind and can be functionalized in subsequent steps following linker cleavage). The reaction 
condition, linker, reactatit, and site to be functionalized are chosen to avoid intermolecular 
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reactions and accelerate intramolecular reactions. It will also be appreciated that the method of 
the present invention contemplates both sequential and simultaneous contacting of the template 
with transfer units depending on the particular compoimd to be synthesized. In certain 
embodiments of special interest, the multi-step synthesis of chemical compounds is provided in 
w^ch the template is contacted sequentially with two or more transfer units to facilitate multi- 
step synthesis of complex chemical compounds. 

[00106] After the sites on the scaffold have been modified, the newly synthesized small 
molecule is linked to the. template that encoded is synthesis. Decodmg of the template tag will 
allow one to elucidate the synthetic history and tliereby the structure of the small molecule. The 
template may also be amplified in order to create more of the desired small molecule and/or the 
template may be evolved to create related small molecules. The small molecule may also be 
cleaved fi-om the template for purification or screening. 

[00107J As would be appreciated by one of skill m tliis art, a plurality of templates may be 
used to encode the synthesis of a combinatorial library of small molecules using the method 
described above. This would allow for the amplification and evolution of a small molecule 
library, a feat which has not been accomplished before the present invention. 

Methodof Synthesizing Libraries of Compounds 

[00108] In :the inventive method, a nucleic acid template, as described above, is provided to 
direct the synthesis of an unnatural polymer, a small molecule, or any other type of molecule of 
interest In general, a plurality of nucleic acid templates is provided wherein the number of 
dijfferent sequences provided ranges firom 2 to 10^^. In. one embodiment of the present invention, 
a plurality of nucleic acid templates is provided, preferably at least 100 different nucleic acid 
templates, more preferably at least 10000 different nucleic acid templates, and most preferably at 
least 1000000 different nucleic acid templates. Each template provided comprises a unique 
nucleic acid sequence used to encode the synthesis of a particular unnatural polymer or small 
molecule. As described above, the template may also have fiinctionality such as a primary amine 
fiom which the polymerization is initiated or a small molecule scaffold. In certain embodiments, 
tiie nucleic acid templates are provided in one "pot". In certain other embodiments, the 
templates are provided in aqueous media, and subsequent reactions are performed in aqueous 
media. 
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(00109] To the template is added transfer units with anti-codons, as described abdve, 
associated with a monomer unit, as described above. In certain embodiments, a plurality of 
transfer units is provided so that there is an anti-codon for every codon represented in the 
template. In a preferred embodiment, certain anti-codons are used as start and stop sites. In 
general, a large enough number of transfer units is provided so that all corresponding codon sites 
on the template are filled after hybridization, 

[00110] The anti-codons of the transfer units are allowed to hybridize to the nucleic acid 
template thereby bringing the monomer units togetlier in a specific sequence as determined by 
the template. In the situation where a small molecule library is being synthesized, reactarits are 
brought in proximity to a small molecule scaffold. The hybridization conditions, as would be 
appreciated by those of skill in the art, should preferably allow for only perfect matching 
between the codon and its anti-codon. Even single base pair mismatches should be avoided. 
Hybridization conditions may include, but are not limited to; temperature, salt concentration, pH, 
concentration of template, concentration of anti-codons, and solvent. The hybridization 
conditions used in syntliesizmg the library may depend on the length of the codon/anti-codon, 
the sunilarity between the codons present in the templates, the content of G/C versus A/T base 
pairs, etc (for further information regarding hybridization conditions, please see, Molecular 
Cloning: A Laboratory Manual, 2nd Ed., ed. by Sambrook, Fritsch, and Maniatis (Cold Spring 
. Harbor Laboratory Press: 1989); Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. 
1984); the treatise. Methods in Enzymology (Academic Press, Inc., N.Y.); Ausubel et al Current 
Protocols in Molecular Biology (John Wiley & Sons, Inc., New York, 1999); each of which is 
incorporated herein by ireference). 

[00111] After hybridization of the anti-codons to the codons on the template have occuiied, 
the monomer units are then polymerized in the case of the synthesis of imnatural polymers. The 
polymerization of the monomer units may occur spontaneously or may need to be initiated, for 
example, by the deprotection of a reactive groups such as a nucleophile or by providing light of a 
certain wavelength. In certain other embodiments, polymers can be. catalyzed by DNA 
polymerization capable of effecting polymerization of non-natural nucleotides (see, Exaniple 9)- 
The polymerization preferably occurs in one direction along the template with adjacent monomer 
units becoming joined through a covalent linkage. The termination of the polymerization step 
occurs by the addition of a monomer unit that is not capable of being added onto. In the case of 
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the synthesis of small molecules, the reactants are allowed to react with the small molecule 
scaffold. The reactanf may react spontaneously, or protecting groups on the reactant and/or the 
small molecule scaffold may need to be removed. Other reagents acid, base, catalyst, 
hydrogen gas, etc.) may also be needed to effect the reaction (see. Examples 5A-5E). 
[00112] After the unnatural polymers or small molecules have been created v\^ith the aid of the 
nucleic acid template, they may be cleaved from the nucleic acid template and/or anti-codons 
used to synthesize them. In certain embodiments, the polymers or small molecules are assayed 
before being completely detached from the nucleic acid templates that encode them. -Once the 
polymer or small molecule is selected, the sequence of the template or its complement may be 
determined to elucidate the structure of the attached polymer or small molecule. This sequence 
may then be amplified and/or evolved to create new libraries of related polymers or small 
molecules that in turn may be screened and evolved. 

Uses 

[00113] The methods and compositions of the present invention represent a new way to 
generate molecules with desired properties. This approach marries the extremely powerful 
genetic methods, which molecular biologists have taken advantage of for decades, with the 
flexibility and power of organic chemistry. The ability to prepare, amplify, and evolve unnatural 
polymers by genetic selection may lead to new classes of catalysts that possess activity, 
bioavailability, stability, fluorescence, photolability, or other properties that are difficult or 
impossible to achieve using the limited set of building blocks found in proteins and nucleic acids. 
Similarly, developing new systems for preparing, amplifying, and evolving small molecules by 
iterated cycles of mutation and selection may lead to the isolation of novel ligands or drugs with 
properties superior to those isolated by slower traditional drug discovery methods (see. Example 

n 

[00114] Performing organic library synthesis on the molecular biology scale is a 
fundamentally different approach from traditional soUd phase library synthesis and carries 
significant advantages. A library created using the inventive methods can be screened using any 
method known in this art (e.g., binding assay, catalytic assay). For example, selection based on 
binding to a target molecule can be carried out on the entire library by passing the library over a 
resin covalently linked to the target. Those biopolymers that have affinity for the resin-bound 
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target can be eluted with free target molecules, and the selected compounds can be amplified 
using the methods described above. Subsequent rounds of selection and amplification can result 
in a pool of compounds enriched with sequences that bind the target molecule. In certain 
embodiments, the target molecule mimics a transition state of a chemical reaction, and the 
chemical compoimds sefected may serve as a catalyst for the chemical reaction. Because the 
information encoding the synthesis of each molecule is covalently attached to the molecule at 
one end, an entire library , can be screened at once and yet each molecule is selected on an 
individual basis. 

100115] Such a library can also be evolved by introducing mutations at the DNA level using 
error-prone PGR (Cadwell et al PCR Methods Appl 2:28, 1992; incorporated hereui by 
reference) or by subjecting the DNA to in vitro homologous recombination (Steimner Proc, Natl, 
Acad. Sci USA 91:10747, 1994; Stammer Nature 370:389, 1994; each of which is incorporated 
herein by reference), Repeiated cycled of selection, amplification, and mutation may afford 
biopolymers with greatly increased binding affinity for target molecules or with significantly 
improved catalytic properties. The final pool of evolved biopolymers having, the desired 
properties can be sequenced by sequencing the nucleic acid cleaved firom the polymers. The 
nucleic acid-firee polymers can be purified using any method known in the art includiiig HPLC, 
column chromatography, FLPC, etc., and its binding or catalytic properties can be verified in the 
absence of covalently attached nucleic acid. 

[00116] The polymerization of synthetically-generated monomer units mdependent of the 
ribosomal machinery allows the incorporation of an enormous variety of side chains with novel 
chemical, biophysical, or biological properties. Terminating each biopolymer with a biotin side 
chain, for example, allows the fecile purification of only full-length biopolymers which have 
been completely translated by passing the library through an avidin-linked resin. Biotin- 
teiminated biopolymers. can be selected for the dctual catalysis of bond-breaking reactions by 
passing these biopolymers over resin linked through the substrate to avidin OFigure 5). Those 
biopolymers that catalyze substrate cleavage would self-elute firom a column charged with this 
resiiL Similarly, biotin-terminated biopolymers can be selected for the catalysis of bond-forming 
reactions (Figure 5). One substrate is linked to resin and the second substrate is linked to avidiiL 
Biopolymers that catalyze bond formation between the substrates are selected by their ability to 
ligate the substrates together, resulting in attachment of the biopolymer to the resin. Novel side 
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chains can also be used to introduce cofactor into the biopolymers. A side chain containing a 
metal chelator, for example, may provide biopolymers with metal-mediated catalytic properties, 
while a flavin-containing side chain may equip biopolymers with the potential to catalyze redox 
reactions. 

[00117] In this manner uimatural biopolymers may be isolated which serve as artificial 
receptors to selectively bind molecules or which catalyze chemical reactions. Characterization 
of these molecules would provide important insight into tlie ability of polycarbamates, polyureas, 
polyesters, polycarbonates, polypeptides with unnatural side chain and stereochemistries, or 
other unnatural polymers to form secondary or tertiary structures with binding or catalj'tic 
properties. 

Kits 

[00118] The present invention also provides kits and compositions for use in the inventive 
methods. The kits may contain any item or composition useful m practicing the present 
invention. The kits may include, but is not limited to, templates, anticodons, transfer units, 
monomer units, building blocks, reactants, small. molecule scaffolds, buffers, solvents, enzymes 
(e.g., heat stable polymerase, reverse transcriptase, ligase, restriction endonuclease, exonuclease, 
Klenow fmgment, polymerase, alkaline phosphatase, polynucleotide kinase), linkers, protecting 
groups, polynucleotides, nucleosides, nucleotides, salts, acids, bases, solid supports, or any 
combinations thereof. 

[00119] As would be appreciated by one of skill in this art, a kit for preparing uimatural 
polymers would contam items needed to prepare imnatural polymers using the inventive methods 
described herein. Such a kit may include templates, anti-codons, transfer units, monomers units, 
or combinations thereof. A kit for synthesizing small molecules may include templates, anti- 
codons, transfer units, buildiilg blocks, small molecule scaffolds, or combinations thereof. 
[00120] The inventive kit may also be equipped ^th items needed to amplify and/or evolve a 
polynucleotide template such as a heat stable polymerase for PGR, nucleotides, buffer, and 
primers. In certain other embodiments, the inventive kit includes items commonly used in 
performing DNA shuffling such as polynucleotides, ligase, and nucleotides. 
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[00121] In addition to the templates and transfer units described herein, the present invention 
also includes compositions comprising complex small molecules, scaffolds, or unnatural polymer 
prepared by any one or more of the methods of the invention as described herein. 

Equivalents 

The represientative examples that follow are intended to help illustrate the invention, and 
are not intended to, nor should they be construed to, limit the scope of the invention. Indeed, 
various modifications of the invention and many fiirther embodiments thereof, in addition to 
those shown and described herein, will become apparent to those skilled in the art from the full 
contents of this document, including the examples which follow and the references to the 
scientific and patent literature cited herein. It should fiirther be appreciated that the contents of 
those cited references are incorporated herein by reference to help illustrate the state of the art. 

The following examples contain important additional information, exemplification and 
guidance that can be adapted to the practice of this invention in its various embodiments and the 
equivalents thereof. 

Exemplification 

[001221 iSxample 1: The Generality of DNA-Templated Synthesis : Clearly, implementing 
the small molecule evolution approach described above requires establishing the generality of 
DNA-templated synthesis. The present invention, for the first time, establishes the generality fo 
this approach and thus enables the. syntheis of a vareity of chemical compounds using DNA- 
templated synthesis. As shown in Figure 6a, the ability of two DNA architectures to support 
solution-phase DNA-templated synthesis was established. Both hairpin (H) and end-of-helix (E) 
templates bearing electrophilic maleimide groups reacted efficiently with one equivalent of thiol 
reagent Imked to a complementary DNA oligonucleotide to yield the thioeflier product in 
minutes at 25 ''C. DNA-templated reaction rates (/:app = -10^ NT^s' ■) were similar for H and E 
. architectures despite significant differences in the relative orientation of their reactive groups. In 
contrast, no product was observed when using reagents containing sequence mismatches, or 
when using templates pre-quenched with excess P-mercaptoethanoI (Fig. 6a). Both templates 
therefore support the sequence-specific DNA-templated addition of a thiol to a maleimide even 
though the structures of tlie resulting products differ markedly fi-om the structure of die natural 
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DNA backbone. Little or no non-templated intermoleciilar reaction products are observed under 
the reaction conditions (pH 7,5, 25 °C, 250 mM NaCl, 60 nM template and reagent). 
[00123] Additionally, sequence-specific DNA-templated reactions spanning a variety of 
reaction types (Sn2 substitutions, additions to a,|3-unsaturated carbonyl systems, and additions to 
vinyl sulfones), nucleophiles (thiols and amines), and reactant structures all proceeded in good 
yields with excellent sequence selectivity (Fig. 6b): Expected product masses were verified by 
mass spectronietry. In each case, matched but not mismatched reagents afforded product 
efi5ciently despite considerable variations in their transition state geometry, steric hindrance, and 
cohformatibnal flexibility. Collectively these findings indicate that DNA-templated synthesis is 
a gmeral phenomenon capable of supporting a range of reaction types, and is not limited to the 
creation of structures resembling nucleic acid backbones as described previously. 
[00124] Since sequence discrimination is important for the faithful translation of DNA into 
synthetic structures, tlie reaction rate of a matched reagent compared with that of a reagent 
bearing a single mismatched base near the center of its 10-base oligonucleotide was measured. 
At 25.®C, the initial rate of reaction of matched thiol reagents with iodoacetamide-iinked H 
templates is 200-fold faster than that of reagents bearing a single mismatch (A^pp = 2.4 x 10^ WT^s" 
^ vs. 1.1 X 10^ M^s"^, Fig. 7). In addition, small amounts of products arising from the annealing 
of mismatched reagents can be eliminated by elevating the reaction temperature beyond the Tm of 
the mismatched reagents (Fig. 7). The decrease in the rate of product formation as temperature 
is elevated further indicates that product formation proceeds by a DNA-templated mechanism 
rather than a simple intermolecular mechanism. 

[00125] In addition to reaction generality and sequence specificity^ DNA-templated synthesis 
also demonstrates remarkable distance independence. Both H and E templates linked to 
maleimide or a-iodbacetamide groups promote sequence-specific reaction with matched, but not 
mismatched, thiol reagents annealed anywhere on the templates examined thus far (up to 30 
bases away from the reactive group on the template). Reactants annealed one base away react 
with similar rates as tiiose annealed 2, 3, 4, 6, 8, 10, 15, 20, or 30 bases away (Fig. 8). In all 
cases, templated reaction rates are several hundred-fold higher than the rate of untemplated 
(mismatched) reaction (^app = lO'^-lO^ M'^s*^ vs. 5 x 10^ M"*s"^). At intervening distances of 30 
bases, products are efiicientiy formed presumably through transition states resembling 200- 
membered rings. These findings contrast sharply with the .well-known difficulty of 
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macrocyclization (see, for example, G. lUuminati et al Acc. Chem, Res. 1981, 14, 95-102; R. B. 
Woodward et al J. Am. Chem, Soa 1981, 103, 3210-3213) in organic synthesis. 
[00126] To determine the basis of the distance independence of DNA-tempiated synthesis, a 
series of modified E templates were first synthesized in which, the intervening bases were 
replaced by a series of DNA analogs designed to evaluate the possible contribution of (z) 
interbase interactions, (it) conformational preferences of the DNA backbone, (iii) the charged 
phosphate backbone, and (/v) backbone hydrophilicity. Templates in which the intervening 
bases were replaced with any of the analogs ia Fig. 9 had little effect on the rates of product 
formation. These findings indicate diat backbone structural elements specific to DNA are not 
responsible for the observed distance independence of DNA-templated synthesis. However, the 
addition of a 10-base DNA oligonucleotide "clamp" complementary to the single-stranded 
intervening region significantly reduced product formation (Fig. 9), suggesting that the flexibility 
of this region is critical to efficient DNA-templated synthesis. 

[00127] The distance independent reaction rates may be esqplained if the bond-forming events 
in a DNA-templated format are sufficiently accelerated relative to their nontemplated 
counterparts such that DNA annealing, ratlier than bond formation, is rate-determiciing. If DNA 
annealing is at least partially rate limiting, then the rate of product formation should decrease as 
the concentration of reagents is lowered because aimealing, unlike templated bond formation, is 
a bimolecular process. Decreasing the concentration of reactants in the case of the E template 
with one or ten intervening bases between reactive groups resulted in a marked decrease in the 
observed reaction rate (Fig. 10). This observation suggests that proximity effects in DNA- 
templated synthesis can enhance bond formation rates to the point that DNA annealing becomes 
rate-determining, 

[00128] These findings raise the possibility of using DNA-templated synthesis to translate . in 
one pot libraries of DNA into solution-phase libraries of synthetic molecules suitable for PCR 
amplification and selection. The ability of DNA-templated synthesis to support a variety of 
transition state geometries suggests its potential in directing a range of powerfiil water- 
compatible synthetic reactions (see, Li, C.J, Organic Reactions in Aqueous Media, Wiley and 
Sons, New York: 1997). The sequence specificity described above suggests that mixtures of 
reagents may be able to react predictably with complementary mixtures of templates. Finally, 
the observed distaiice independence suggests that different regions of DNA "codons" may be 
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used to encode different groups on the same synthetic scaffold without impairing reactions rates. 
As a demonstration of this approach, a library of 1,025 maieimide-linked templates was 
syntheisized, each with a different DNA sequence in an eight-base encoding region (Fig. 1 1). 
One of these sequences, 5'-TGACGGGT-3', was arbitrarily chosen to code for the attachment of 
a biotin group to the template, A library of thiol reagents linked to 1,025 different 
oligonucleotides was also generated. The reagent linked to 3'-ACTGCCCA-5' contamed a 
biotin group, while the other 1,024 reagents contained no biotin. Equimolar ratios of all 1,025 
templates and 1,025 reagents were mixed in one pot for 10 minutes at 25 and the resulting 
products were selected in viti o for binding to streptavidin. Molecules surviving the selection 
were amplified by PGR and analyzed by restriction digestion and DNA sequencing. 
100129] Digestion with the restriction endonuclease Tsp45l, which cleaves GTGAC and 
therefore cuts the biotin encoding template but none of the other templates, revealed a 1:1 ratio 
of biotin encoding to non-biotin encoding templates following selection (Fig. 12). This 
represents a 1,000-fold enrichment compared with the unselected library. DNA sequencing of 
the PGR amplified pool before and after selection suggested a similar degree of enrichment and 
indicated that the biotin-encoding template is the major product after selection and amplification 
(Fig. 12). The ability of DNA-templated synthesis to support the simultaneous sequence-specific 
reaction of 1,025 reagents, each of which faces a 1,024:1 ratio of non-partner to partner 
templates, demonstrates its potential as a method to create synthetic libraries in one pot. The * 
above proof-of-principle translation, selection, and amplification of a synthetic library member 
having a specific property (avidin affinity in tliis example) addresses several key requirements 
for the evolution of non-natural small molecule libraries toward desired properties. 
[00130] Taken together, these results suggest that DNA-templated synthesis is a surprisingly 
general phenomenon capable of directing, rather than simply encoding, a range of chemical 
reactions to form products unrelated in stmcture to nucleic acid backbones. For several reactions 
examined, the DNA-templated format accelerates the rate of bond formation beyond the rate of a 
10-base DNA oligonucleotide annealing to its complement, resulting in surprising distance 
independence. The fecile nature of long-distance DNA-templated reactions may also arise in 
part firom the tendency of water to contract the volume of nonpolar reactants {see, C.-J. Li et al. 
Organic Reactions in Aqueous Media, Wiley and Sons: New York, 1997) and fi^om possible 
compactness of the intervening single^stranded DNA between reactive groups. These findings 



33 of 88 



wo 02/074929 



PCTAJS02/08546 



may have implications for prebiotic evolution and for understanding the mechanisms of catalytic 
nucleic acids, which typically localize substrates to a strand of RNA or DNA. 
[00131] Methods: 

[00132] DNA synthesis. DNA oligonucleotides were synthesized on a PerSeptive 
Biosystems Expedite 8909 DNA synthesizer usmg standard protocols and purified by reverse 
phase HPLC. Oligonucleotides were quantitated spectrophotometrically and by denaturing 
polyacrylamide gel electrophoresis (PAGE) followed by staining with ethidium bromide or 
SYBR Green (Molecular Probes) and quantitation using a Stratagene Eagle Eye II densitometer. 
Phosphoramidites enabling the synthesis of 5'-NH2-dT, 5' tetrachlorofluorescein, abasic 
backbone spacer, C3 backbone spacer, 9-bond polyethylene glycol spacer, 12-bond saturated 
hydrocarbon spacer, and 5* biotin groups were purchased from Glen Research. Thiol-linked 
oligonucleotide reagents were synthesized on C3 disulfide controlled pore glass (Glen Research). 
[00133] Template functionalization. . Templates bearing 5*-NH2-dT groups were transformed 
into a variety of electrophilic fiznctional groups by reaction with the appropriate electrophile- 
NHS ester (Pierce). Reactions were performed in 200 mM sodium phosphate pH 7.2 with 2 
mg/mL electrophile-NHS ester, 10% DMSO. and up to 100 fig of S'-amino template at 25 ''C for 
1 h. Desired products were purified by reverse-phase HPLC and characterized by gel 
electrophoresis and MALDI mass spectrometry. 

[00134] DNA-tempIated synthesis reactions. Reactions were initiated by mixing equknolar 
quantities of reagent and template in buffer containing 50 mM MOPS pH 7.5 and 250 mM NaCl 
at the desired temperature (25 °C unless stated otherwise). Concentrations of reagents and 
templates were 60 nM unless otherwise mdicated. At various time pomts, aliquots were 
removed, quenched with excess p-mercaptoethanol, and analyzed by denaturing PAGE. 
Reaction products were quantitated by densitometry using their intrinsic fluorescence or by 
stabling followed by densitometry. Representative products were also verified by MALDI mass 
spectrometry. 

[00135] In vitro selection for avidin binding. Products of the library translation reaction 
were isolated by ethanol precipitation and dissolved in binding buffer (10 mM Tris pH 8, 1 M 
NaCl, 10 mM EDTA). Products were incubated wtih 30 \ig of streptavidin-linked magnetic 
beads (Roche Biosciences) for 10 min at room temperature in 100 uL total volume. Beads were . 
washed 16 times with binding buffer and eluted by treatment with 1 |imol fiee biotin in 100 uL 
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binding buffer at 70 °C for 10 minutes. Elated molecules were isolated by ethanol precipitation 
and amplified by standard PGR protocols (2 mM MgCb, 55 °C annealing, 20 cycles) using the 
primers S'-TGOTGCGGAGCCGCCG and 5'-CCACTGTCCGTGGCGCGACCCCGGCTCC 
TCGGCTCGG. Automated DNA sequencing used the primer 5'- 

CCACTGTCCGTGGCGCGACCC. 

[00136] DNA Sequences. Sequences not provided m the figures are as follows: matched 
reagent in Fig. iSb. SIAB and SBAP reactions: 5'-CCCGAGTCGAAGTCGTACC-SH; 
mismatched reagent in Fig, 6b SIAB and SBAP reactions: 5*-GGGCTCAGCTTCCCCATAA- 
SH; mismatched reagents for other reactions in Figs. 6b, 6c, 6d, and 8a; 5*-FAAATCTTCCC- 
SH (F= tetrachlorofluorescein); reagents m Figs. 6c and 6d containing one mismatch: 5- 
FAATTCTTACC-.SH; E templates in Figs. 6a, 6b SMCC, GMBS, BMPS, and SVSB reactions, 
and 8a: ' 5*-(NH2dT)- 

CGCGAGCGTACGCrCGCGATGGTACGAATTCGACTCGGGAATACCACCTTCGACT^ 
AGG; H template in Fig. 6b SIAB, SBAP,.and SIA reactions: 5'-(NH2dT)- CGCGAGCGTACG 
CTCGCG ATGGTACGAATTC; clamp oUgonucleotide inFig 8b: S'-ATTCGTACCA 

[00137] Example 2; Exemplary Reactions for Use in PNA-Templated Synthesis ; 
[00138] As discussed above, the generality of DNA-templated synthetic chemistry was 
examined (see, Liu et al 1 Am. Chetn. Soc. 2001, 123, 6961). Specifically, the abiUty of DNA- 
templated synthesis to direct a modest collection of chemical reactions without requiring the 
precise aligmnent of reactive groups into DNA-like confonnations was demonstrated. Indeed, 
the distance independence and sequence fidelity of DNA-templated synthesis allowed the 
sunultaneous, one-pot translation of a model library of more than 1,000 templates into the 
corresponding thioether products, one of which was enriched by in vitro selection for binding to 
the protein streptavidin and amplified by PGR. 

[00139] As described in detail herein, the generality of DNA-templated synthesis has been 
fiirther expanded and it has been demonstrated that a variety of chemical reactions can be 
utilized for the construction of small molecules and in particular, for the first time, DNA- 
templated organometalhc couplings and carbon-carbon bond forming reactions other than 
pyrimidine photodimerization. These reactions clearly represent an important step towards the in 
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vitro evolution of non-natural synthetic molecules by enabling the DNA-templated construction 
of a much more diverse set of structures than has previously been achieved. 
(00140] The ability of DNA-templated synthesis to direct reactions that require a non-DNA- 
linked activator, catalyst or other reagent in addition to the principal reactants has also been 
demonstrated herein. To test the ability of DNA-templated synthesis to mediate such reactions 
without requiring structural mimicry of the DNA-templated backbone, DNA-templated reductive 
aminations between an amine-linked template (1) and benzaldehyde- or glyoxal-linked reagents 
(3) with millunolar concentrations of NaBHaCN at room temperature in aqueous solutions can be 
perfonned. Significantly, products formed efficiently when the template and reagent sequences 
were complementary, while control reactions in which the sequence of the reagent did not 
complement that of the template, or m which NaBHaCN was omitted, yielded no significant 
product (see Figures 13 and 14). Although DNA-templated reductive aminations to generate 
products closely mimicking the structure of double-stranded DNA have been previously reported 
(see, for example, X. Li et al 1 Am. Ghent Soc, 2002, 124, 746 and Y. Gat et al Biopolymers 
1998, 48, 19), the above results demonstrate that reductive amination to generate structures 
unrelated to the phosphoribose backbone can take place efficiently and sequence-specifically. 
Referring to Figure 15, DNA-templated aide bond formations between amine-linked templates 4 
and 5 and carboxylate-linked reagents 6-9 mediated by l-(3-dimethylaminopropyl)-3- 
ethylcarbodiimide (EDC) and N-hydroxylsulfosuccinimide (sulfo-NHS) to generate amide 
products in good yields at pH 6.0, 25°C (Figure 15). Product formation was sequence-specific, 
dependent on the presence of EDC, and suprisingly insensitive to the steric encumbrance of the 
amine or carboxylate. Ef&cient DNA*templated amide formation was also mediated by the 
water-stable actiyator 4-(4,6-dimethoxy-l ,3,5-trizin-2-yl)-4-methylmorpholinium chloride 
(DMT-MM) instead of EDC and sulfo-NHS (Figures 14 and 15). The eflSciency and generality 
of DNA-templated amide bond formation under these conditions, together with the large number 
-of commercially available chiral amines and carboxylic acids, make this reaction an attractive 
candidate in future DNA-templated syntheses of structurally diverse small molecule libraries. 
[00141] It will be appreciated that carbon-carbon bond forming reactions are also important in 
both chemical and biological syntheses and thus several such reactions are utilized in DNA- 
templated format. Both the reaction of nitroalkane-linked reagent (10) with aldehyde-linked 
template (11) (nitro-aldol or Henry reaction) and the conjugate addition of 10 to maleimide- 
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linked template (12) (nitro-Michael addition) proceeded efficiently and with high sequence 
specificity at pH 7.5-8.5, 25°C (Figures 13 and 14). In addition, the sequence-specific DNA- 
templated Wittig reaction between stabilized phosphorus ylide reagent 13 and aldehyde-linked 
templates 14 or 11 provided the corresponding olefin products in excellent yields at pH 6.0-8.0, 
25°C (Figures 13 and 14). Similarly, the DNA templated 1,3-dipolar cycloaddition between 
nitrone-linked reagents 15 and 16 and olefin-hnked templates 12, 17 or 18 also afforded products 
sequence specifically at pH 7.5, 25*^C (Figures 13 and 14). 

[00142] In addition to the reactions described above, organometallic coupling reactions can 
also be utilized in the present invention. For example, DNA-templated Heck reactions were 
performed in the presence of water-soluble Pd precatalysts. In the presence of 170 mM 
NaaPdCUy aryl iodide-linked reagent 19 and a variety of olefin-linked templates including 
maleimide 12, acrylamide 17, vinyl sulfone 18 or cinnamamide '20 yielded Heck coupling 
products in modest yields at pH 5.0, 25°C (Figures 13 and 14). For couplings with olefms 17, 18 
and 20, adding two equivalents of P(p-S03C6H4)3 per equivalent of Pd prior to template and 
reagent addition' typically increased overall yields by 2-fold. Control reactions containing 
seqiience mismatches or lacking Pd precatalyst yielded no product. To our knowledge, the above 
DNA-templated nitro aldol addition, nitro Michael addition, Wittig olefination, dipolar 
cycloaddition, and Heck coupling represent the fust reported nucleic-acid templated 
organometallic reactions and carbon-carbon bond forming reactions other than pyrimidine 
photodimerization. 

[00143] It was previously discovered that tiie same DNA-templated reactions demonstrate 
distance independence, the ability to. form product at a rate independent of the number of 
intervening bases between annealed reactants. It was hypothesized (Figure 16a) that distance 
independence arises when tiie rate of bond formation in the DNA-templated reaction is;greater 
than the rate of template-reagent annealing. Although only a subset of chemistries fall into this . 
category, any DNA-templated reaction that affords comparable product yields when the reagent 
is annealed at various distances from the reactive end of the template is of special interest 
because it can be encoded at a variety of template positions. To evaluate the ability of the DNA- 
templated reactions developed above to take place eflficientiy v^en reactants are separated by 
distances relevant to library encoding, the yields of reductive amination, amide formation, nitro- 
aldol addition, nitrp-Michael addition, Wittig olefination, dipolar cycloaddition, and Heck 
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coupling when zero or ten bases separated annealed reactive groups (Figure 16a, n=0 versus 
n=10) were compared. Among tlae reactions described above or in previous work, amide bond 
fonnation, nitro-aldol addition, Wittig olefination, Heck coupling, conjugate addition of thiols to 
maleimides and Sn2 reaction between thiols and a-iodo amides demonstrate comparable product 
fonnation when reactive groups are separated by zero or ten bases (Figure 16b). These findings 
indicate that these reactions can be encoded dining synthesis by nucleotides that are distal from 
the reactive end of the template without significantly impahing product formation. 
[00144] In addition to the DNA-templated Sn2 reaction, conjugate addition, vinyl sulfone 
addition, amide bond formation, reductive amination, nitro-aldol (Henry reaction), nitro Michael, 
Wittig olefination, 1,3-dipolar cycloaddition and Heck coupling reactions described directiy 
above, a variety of additional reagents can also be utilized in the method of the present invention. 
For example, as depicted in Figure 17, powerfiil aqueous DNA-templated synthetic reactions 
including, but not limited to, the Lewis acid-catalyzed aldol addition, Mannich reaction, 
Robinson annulation reactions, additions of allyl indiuin, zinc and tin to ketones and aldehydes, 
Pd-assisted allylic substitution, Diels-Alder cycloadditions, and hetero-Diels-Alder reactions can 
be utilized efficiently in aqueous solvent and are important complexity-building reactions. 
[00145] Taken together, these results expand considerably the reaction scope of DNA- 
templated synthesis. A wide variety of reactions proceeded efficiently and selectively only when 
the corresponding reactants are programmed with complementary sequences. By augmenting the 
repertoire of known DNA-templated reactions to now include carbon-carbon bond forming and 
organometallic reactions (nitro-aldol additions, nitro-Michael additions, Wittig olefinations, 
dipolar cycloadditions, and Heck couplings) in addition to previously reported amide bond 
fomiation (see, Schmidt et al Nucleic Acids Res. 1997, 2J, 4792; Bruick et al . CherrL Biol 1996, 
3, 49), imine formation (Czlapinski et al 1 Am, Chem. Soc, 2001, 725, 8618), reductive 
anunation (Li et al J. Am. Chem. Soc. 2002, 124, 746; Gat et al Bioppfymers, 1998, 48, 19), Sn2 
reactions (Gartner etal J. Am. Chem. Soc. 2001, 725, 6961; Xu etal Nat. Bioiechnol 2001, 7P, 
148; Herrlein etal J. Am, Chem. Soc. 1995, 777, 10151) conjugate addition of thiols (Gartner et 
d. J. AfTL Chem. iSbc. 2001, 725, 6961), and phosphoester or phosphonamide formation (Orgel et 
al. Acc. Chem. Res. 1995, 25, 109; Luther et al Nature, 199S, 396, 245), these results may 
enable the sequence-specific translation of libraries of DNA into libraries of structurally and 
fiinctionally diverse synthetic products. Since, minute quantities of template^ encoding desired 
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molecules can be amplified by PGR, the yields of DNA-templated reactions are arguably less 
critical .than the yields of traditional synthetic transformations. Nevertheless, many of the 
reactions developed above proceed ejBSciently. In addition, by demonstrating that DNA- 
templated synthesis in the absence of proteins can direct a large diversity of chemical reactions, 
these findings support previously proposed hypotheses that nucleic acid-templated synthesis rriay 
have translated replicable information into some of the earliest fimctional molecules such as 
polyketides, terpenes and polypeptides prior to the evolution of protein-based en2ymes. The 
diversity of chemistry shown here to be controllable sunply by bringing reactants iiito proximity 
by DNA hybridization vdthout obvious structural requirements provides an experimental basis 
for these possibilities. The translation of amplifiable information into a wide range of structures 
is a key requirement for applying nature's molecular evolution approach to the discovery of non- 
natural molecules with new fimctiotis. 

[00146] Methods for Exemplary Reactiofis for Use in DNA-Templated Syntltesis: 
[00147] Functionalized templates and reagents were typically prepared by reacting 5'-NH2 
terminated oligonucleotides (for template 1), 5'-NH2-(CH20)2 terminated oligonucleotides (for 
all odier templates) or 3'-OP03-CH2CH(CH20H)(CH2)4NH2 tenninated nuclotides (for all 
reagents) with the appropriate NHS esters (0.1 volumes of a 20 mg/mL solution in DMF) in 0.2 
M sodium phosphate buffer, pH 7.2, 25*'C, 1 h to provide the template and reagent structures 
shown in Figures 13 and 15. For amino acid linked reagents 6-9, 3'- 
OP03CH2CH(CH20H)(CH2)4NH2 terminated oligonucleotides in 0;2 M sodium phosphate 
buffer, pH 7.2 were reacted vwth O.l volumes of a 100 mM bis[2- 
(succimmidyloxycarbonyloxy)etiiyl]sulfone (BSOCOES, Pierce) solution in DMF for 10 min at 
25°C, followed by 0.3 volumes of a 300 mM amino acid in 300 mM NaOH for 30 min at 25°C. 
[00148] Functionalized templates and reagents were purified by gel filtration using Sephadex 
G-25 followed by reverse-phase HPLC (0.1 triethylammoniimi acetate-acetonitrile gradient) and 
characterized by MALDI mass spectrometry. DNA templated reactions were conducted under 
the conditions described in Figures 13 and IS and products were characterized by denaturing 
polyacrylamide gel electrophoresis and MALDI mass spectrometry. 

[00149] The sequences of oligonucleotide templates and reagents are as follows (5* to 3' 
direction, n refers to the number of bases between reactive groups when template and reagent are 
annealed as shown in Figure 16). 1: TGGTACGAATTCGACTCGGG; 2 and 3 matched: 
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GAGTCGAATTCGTACC; 2 aad 3 mismatched: GGGCTCAGCTTCCCCA; 4 and 5: 
GGTACGAATTCGACTCGGGAATACCACCTT; 6-9 matched (n = 10): TCGCGAGTCG; 6 
matched (n - 0): AATTCGTACC; 6-9 mismatched: TCACCTAGCA; 11, 12, 14, 17, 18, 20: 
GGTACGAATTCGACTCGGGA; 10, 13, 16, 19 matched: TCCCGAGTCGAATTCGTACC; 
10, 13 16, 19 mismatched: GGGCTCAGCTTCCCCATAAT; 15 matched: AATTCGTACC; 15 
mismatched: TCGTATTCCA; template for n = 10 vs. n = 0 comparison: 
TAGCGATTACGGTACGAATTCGACTCGGGA 

[00150] Reaction yields quantitated by denaturing polyacrylamide gel electrophoresis 
followed by ehidium bromide staining, UV visualization, and CCD-based densitometry of 
product and template starting material bands. Yield calculations assumed that templates and 
products stained with equal intensity per base; for those cases in which products are partially 
double-stranded during quantitation, changes in staining intensity may result in higher apparent 
yields. 

[00151] Example 3; Development of Exemplary Linkers 

[00152] As will be appreciated by one of ordinary skill in the art, it is frequently useful to 
leave the DNA moiety of the reagents linlced to products during reaction development to 
facilitate analysis by gel electrophoresis. The use of DNA-templated synthesis to translate 
libraries of DNA into corresponding libraries of synthetic small molecules suitable for in vitro 
selection, however, requires the development of cleavable linkers connecting reactive groups of 
reagents with their decoding DNA oligonucleotides. As described below and herein, three 
exemplary types of linkers have been developed (see. Figure 18). For reagents with one reactive 
group, it would be desirable to position DNA as a leaving group to the reactive moiety. Under 
this "autocleavable" linker strategy, the DNA-reactive group bond is cleaved as a natural 
consequence of the reaction. As but one example of this approach, a fluorescent Wittig 
phosphorane reagent (14, referring to Figure 19) was synthesized in which the decoding DNA 
oligonucleotide was attached to one of the aryl phosphine groups (see. Figure 19, left). DNA- 
templated Wittig reaction with aldehyde-linked templates resulted in the nearly quantitative 
transfer of the fluorescent group from the Wittig reagent to the template and the concomitant 
liberation of the alkene product from the DNA moiety of the reagent. Additionally, reagents 
bearing more than one reactive group can be linked to theu: decoding DNA oligonucleotides 
through one of two additional linker strategies. In the "scarless" linker strategy, DNA-templated 
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leactioa of one reactive group is followed by cleavage of the linker attached through a second 
reactive group to yield products without leaving behind additional chemical functionality. For 
example, a series of amino acid reagents were syntheisized which were cormected through a 
carbamoylethylsulfone linker to their decoding DNA oligonucleotides (Figure 19, center). 
Products of DNA-templated amide bond formation using these amino acid reagents were treated 
with aqueous alkalme buffer to effect the quantitative elimination and spontaneous 
decarboxylation of the carbamoyl group. The product of leaving this scarless linker is therefore 
the cleanly transferred amino acid moiety. In yet other embodiment of the mvention, a third 
linker strategy, a "usefid scar^' may be utilized on the theory that it may be advantageous to 
introduce useful chemical groups as a consequence of linker cleavage. In particular, a "usefid 
scar" can be functionalized in subsequent steps and is left behind following linker cleavage. For 
example, amino acid reagents linked through 1,2-diols to their decoding DNA oligonucleotides 
were generated. Following amide bond formation, this linker was quantitatively cleaved by 
oxidation with NaI04 to afford products bearing useful aldehyde groups (see. Figure 19, right). 
In addition to the linkers described directly above, a variety of additional linkers can be utilized. 
For example, as shown in Figure 20, a thioester linker can be generated by carbodiimide- 
mediated coupling of thiol-tenninated DNA with carboxylate-containing reagents and can be 
cleaved with aqueous base. As the carboxylate group provides entry into the DNA-templated 
amide bond formation reactions described above, this linker would liberate a "useful scar" when 
cleaved (see, Figure 20). Alternatively, the thioester linker can be used as an autocleavable 
linker during an amine acylation reaction in the presence of Ag(I) cations (see, Zhang et al 1 
Am. Chem, Soc. 199% 121, 331 1-3320) since the tiiiol-DNA moiety of the reagent is liberated as 
a natural consequence of the reaction. It will be appreciated that a thioether linker that can be 
oxidized and eliminated at pH 11 to liberate a vinyl sulfone can be utilized as a "useful scar" 
linker. As demonstrated herein, the vinyl sulfone group serves as the substrate in a number of 
subsequent DNA-templated reactions. 

[00153] Example 4; Exemplary Reactions in Organic Solvents: 

[00154] As demonstrated herein, a variety of DNA-templated reactions can occur in aqueous 
media. It has also been demonstrated, as discussed below, that DNA-templated reactions can 
occur in organic solvents, thus greatly expanding the scope of DNA-templated synthesis. 
Specifically, DNA templates and reagents have been coraplexed with long chain 
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tetraalkylammonium cations (see, Jost et al Nucleic Acids Res, 1989, 77, 2143; Mernikov et al 
Langmuir, 1999, 75, 1923-1928) to enable quantitative dissolution of reaction components in 
anhydrous organic solvents including CH2CI2, CHCI3, DMF and MeOH. Surprisingly, it was 
found that DNA-templated synthesis can indeed occur in anhydrous organic solvents with high 
sequence iselectivity. Depicted in Figure 21 are DNA-templated amide bond formation 
reacations in v^ch reagents and templates are complexed with dimethyldidodecylammonium 
cations either in separate vessels or after preaimealing in water, lyophilized to dryness, dissolved 
in CH2CI2, and mixed together. Matched^ but not mismatched, reactions provided products both 
when reactants were preannealed in aqueous solution and when they were mixed for the first 
time in GH2CP (see. Figure 21). DNA-teinplated amide formation and Pd-mediated Heck 
coupling in anhydrous DMF also proceeded sequence-specifically. Clearly, these observations 
of sequence-specific DNA-templated synthesis in organic solvents implies the presence of at 
least ^some secondary structure within tetraalkylammonium-complexed DNA in organic media, 
and should enable DNA receptors and catalysts to be evolved towards stereoselective bindmg or 
catalytic properties in organic solvents. Specifically, DNA-templated reactions that are known to 
occur in aqueous media, includiag conjugate additions, cycloadditions^ displacement reactions, 
and Pd-mediated couplings can also be performed in organic solveiits. In certain othisr 
embodiments, reactions in organic solvents may be utilized that are inefficient or impossible to 
perform in water. For example, while Ru-catalyzed olefin metathesis in water has been reported 
by Grubbs and co-workers (see, Lynn et al J. Ant Chem, Soc, 1998, 120, 1627-1628; Lynn et al 
J. Am, ChetYL Soc. 2000, 722, 6601-6609; Mohr etal Organometallics 1996, 75, 4317-4325), the 
aqueous metathesis system is extremely fiinctional group sensitive. The fimctional group 
tolerance of Ru-catalyzed olefin metathesis in organic solvents, however, is significantiy more 
robust. Some exemplary reactions to utilize in organic solvents include, but are not limited to 
1,3-dipolar cycloaddition between nitrones and olefins which can proceed through transition 
states that are less polar than ground state starting materials. 

(00155] As detailed above, the generality of DNA-templated synthesis has been established by 
performing several distinct DNA-templated reaction types, none of which are limited to 
producing structures that resemble the natural nucleic acid backbone, and many of which are 
highly usefiil carbqn-carbon bond forming or complexity-building synthetic reactions. It has 
been shown that the distance independence of DNA-templated synthesis allows different regions 
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of a DNA template to each encode different synthetic reactions. DNA-templated synthesis can 
piaiptain sequence fidelity even in a library format in which more than 1,000 templates and 
1,000 reagents react simultaneously in one pot As described above and below, linker strategies 
have been developed, which together with the reactions developed as described above, have 
enabled the first multi-step DNA-templated synthesis of simple synthetic small molecules. 
Additionally, the sequence-specific DNA-templated synthesis in organic solvents has been 
delnonstrated,. further expanding the 'scope of this approacL 

[00156] Example 5; Synthesis of Exemplary Compounds and Libraries of Compounds; 
(00157] A) Synthesis of a Pofycarbamate Library: One embodiment of the strategy 
described above is the creation of an amplifiable polycarbamate library. Of the sixteen possible 
dinucleotides used to encode the library^ one is assigned a start codon Amotion, and one is 
assigned to serve as a stop codon. An artificial genetic code is then created assigning each of the 
up to 14 remaining dinucleotides to a different monomer. For geometric reasons one monomer 
actually contains a dicarbamate containing two side chains. Within each monomer, the 
dicarbamate is attached to the corresponding dinucleotide (analogous to a tRNA anticodon) 
through a silyl enol ether linker which liberates the native DNA and the firee carbamate upon 
treatment with fluoride. The dinucleotide moiety exists as the activated 5'-2-methylimidazole 
phosphate, that has been demonstrated (Inoue et al J. Mol. Biol 162:201, 1982; Rembold et al. 
1 Mol Evol 38:205, 1994; Chen etal X Mol Biol 181:271, 1985; Acevedo etal J, Mol Biol 
197:187, 1987; Inoue et al J. Am. Chem. Sac, 103:7666, 1981; each of which is incorporated 
herein by reference) to serve as an excellent leaving group for template-directed oligomerization 
of nucleotides yet is relatively stable under neutral or basic aqueous conditions (Schwartz et al 
Science 228:585, 1985; incorporated herein by reference). The dicarbamate moiety exists in a 
cyclic form linked through a vinyloxycarbonate linker. The vinylcarbonate group has been 
demonstrated to be stable in neutral or basic aqueous conditions (Olofson et al Tetrahedron Lett. 
18:1563, 1977; Olofson et al Tetrahedron Lett. 18:1567, 1977; Olofson et al Tetrahedron Lett 
18:1571, 1977; each of which is incorporated herein by reference) and ftirther has been shown to 
provide carbamates in very high yields upon the addition of amines (Olofson et al. Tetrahedron 
Lett. 18:1 563, 1 977; incorporated herem by reference). 



43 of 88 



wo 02/074929 



PCTAJS02/08546 



forDNA < hr ^ 



Me 

leaving group I 




polymerization 



accepts nucleophile 



for carbamate 
polymerization; 
releases a new 



nucleophile when 
attacked 



[00158] When attacked by an amine j&om a nascent polycarbamate chain, the vinyl carbonate 
linker, driven by the aromatization of /w-cresol, liberates a ftee amine. This free amine 
subsequently serves as the nucleophile to attack the next vinyloxycarbonate, propagating the 
polymerization of tlie growing carbamate chain. Such a strategy minimizes the potential for 
cross-reactivity and bi-directional polymerization by ensuring that only one nucleophile is 
present at any time during polymerization. 

[00159] Using the monomer described above, artijBcial translation of DNA into a 
polycarbamate can be viewed as a threcrstage process. In the first stage, single stranded DNA 
templates encoding the library are used to guide the assembly and polymerization of the 
dinucleotide moieties of the monomers, terminating with the "stop" monomer which possesses a 
3 'methyl ether instead of a 3 'hydroxyl group (Figure 22). 

[00160] Once the nucleotides have assembled and polymerized into double-stranded DNA, 
the "start" monomer ending in a (j-nitrobenzylcarbamates is photodeprotected to reveal the 
primary amme that initiates carbamate polymerization. Polymerization proceeds in the 5' to 3' 
direction along the DNA backbone, with each nucleophilic attack resulting in the subsequent 
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unmasking of a new amine nucleophile,* Attack of the "stop" monomer liberates an acetamide 
rather than an amine, thereby, termination polymerization (Figure 23). Because the DNA at this 
stage exists in a stable double-stranded form, variables such as temperature and pH> may be 
wcplored to optimize polymerization efficiency. 

[00161] Following polymerization the polycarbamate is cleaved from the phosphate backbone 
of the DNA upon treatment with fluoride. Desilylation of the enol ether linker and the 
elimination of the phosphate driven by the resulting release of phenol provides the 
polycarbamate covalently linked at its carboxy terminus to its encoding single-stranded DNA 
(Figure 24). 

[00162] At this stage the polycarbamate may be completely , liberated from the DNA by base 
hydrolysis of the ester linkage. The liberated polycarbamate can be purified by HPLC and 
letested to verify that its desired properties are intact. The free DNA can be amplified using 
PGR, mutated with error-prone PGR (Cadwell et al PCR Methods AppL 2:28, 1992; 
incorporated herein by reference) or DNA shuffling (Stemmer Proc. Natl Acad. Sci USA 
91:10747, 1994; Stenuner Nature 370:389, 1994; U.S. Patent 5,811,238, issued September 22, 
1998; each of which is incorporated herein by reference), and/or sequenced to reveal the primary 
structure of the polycarbamate. 

[00163] Synthesis of monomer units. After the monomers are synthesized, the assembly and 
polymerization of the monomers on the DNA scaffold should occur spontaneously. Shikimic 
acid 1, available commercially, biosynthetically (Davis Adv, Enzymol 16:287, 1955; 
incorporated herein by reference), or by short syntheses from D-maxmose (Fleet et al J. Chem. 
Soc, Perkins Trans. 7905, 1984; Harvey et al Tetrahedron Lett. 32:4111, 1991; each of which 
is incorporated herein by reference), serves as a convenient starting point for the monomer 
synthesis^ The syn hydroxyl groups are protected as the ;?-methoxybenzylidene, and remaining 
hydroxyl group as the /er^-butyldimethylsilyl ether to afford 2. The carboxylate moiety of the 
protected shikimic acid is then reduced completely by LAH reduction, tosylation of the resulting 
alcohol, and further reduction widi LAH to provide 3. 
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[00164] Commercially available and synthetically accessible N-protected amino acids serve as 
the starting materials for the dicarbamate moiety of each monomer. Reactive side chains are 
protected as photolabile ethers, esters, acetals, carbamates, or thioethers. Following chenustry 
previously developed (Cho et al Science 261:1303, 1993; incorporated herein by reference), a 
desired amino acid 4 is converted to the corresponding amino alcohol 5 by mixed anhydride 
formation with isobutylchloroformate followed by reduction with sodium borohydride. The 
amino alcohol is then converted to the activated carbonate by treatment vnth p- 
nitrophenylchloroformate to afford 6, which is then coupled to a second amino alcohol 7 to 
provide, following hydroxyl group silylation and FMOC deprotection, carbamate 8. 




,NHFMOC 



[00165] Coupling of carbamate 8 onto the shikimic acid-derived linker proceeds as follows. 
The allylic hydroxyl group of 3 is deprotected with TBAF, treated with triflic anhydride to form 
the secondary triflate, then displaced with aminocarbamate 8 to afford 9. Presence of the vinylic 
methyl group in 3 should assist in minimizing the amount of xmdesired product resulting from 
Sn2' addition (Magid Tetrahedt^on 36:1901, 1980; incorporated herein by reference), Michael, 
additions of deprotonated carbamates to a,p-unsaturated esters have been well documented 
(Collado et al Tetrahedron Lett, 35:8037, 1994; Hirama et al. J. Am. Chem. Soc. 107:1797, 
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1985; Nagasaka et al Heterocycles 29:155, 1989; Shishido et al J. Chem, Soc, Perkins Trans. I 
993, 1987; Hirama aL .Heterocycles 28:1229, 1989; each of which is incorporated herein by 
reference). By analogy, the secondary amine is protected as the o-nitrobenzyl carbamate 
(NBOC), and the resulting compound is deprotonated at the carbamate nitrogen. This 
deprotonation can typically be performed with either sodium hydride or potassium tert- 
butyloxide (Collado et al Tetrahedron Lett, 35:8037, 1994; Hirama et al J. Am, Chenu Soc. 
107:1797, 1985; Nagasaka et al Heterocycles 29:155, 1989; Shishido et al J, Chem, Soc. 
Perkins Trans. I 993, 1987; Hirama et al Heterocycles 28:1229, 1989; each of which is 
incorporated herein by reference), although other bases may be utilized to minimize 
deprotonation of the nitrobenzylic protons. Additions of the deprotonated carbamate to a,P- 
unsaturated ketone 10, followed by trapping of the resulting enolate with TBSCl, should afford 
silyl enol ether 11. The previously found stereoselectivity of conjugate additions to 5-substituted 
enones such as 10 (House et al J. Org, Chenu 33:949, 1968; Still et al Tetrahedron 37:3981, 
1981; each of which is mcorporated herein by reference) suggests that preferential formation of 
11 over its diastereomer. Ketone 10, the precursor to the -fluoride-cleavable carbamate- 
phosphate linker, may be synthesized from 2 by one pot decarboxylation (Barton et al 
Tetrahedron 41:3901, 1985; incorporated herein by reference) followed by treatment with 
TBAF, Swem oxidation of the resultiag alcohol to afford 12, deprotection with DDQ, selective 
nitrobeniyl ether formation of the less-hindered alcohol, and reduction oif the a-hydroxyl group 
with samarium iodide (Molander In Organic Reactions, Paquette, Ed. 46:21 1, 1994; incorporated 
herein by reference). 
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[00166] The p-methoxybenzylidiene group of 11 is transfomied into the a-hydroxy PMB ether 
using sodium cyanoborohydride and IMS chloride (Johansson et al J. Chem. Soc, PerJdn Trans. 
1 2371, 1984; incorporated herein by reference) and the TES group deprotected with 2% HF 
(conditions that should not affect the TBS ether (Boschelli et al. Tetrahedron Lett. 26:5239, 
1985; incorporated herein by reference)) to provide 13. The PMB group, following precedent 
(Johansson et al 1 Chem. Soc, Perkin Traits. 72371, .1984; Sutherlin et al Tetrahedron Lett, 
34:4897, 1993; each of which is incorporated herein by reference), should remain on the more 
hindered secondary alcohoL The two free hydroxyl groups may be macrocyclized by very slow 
addition of 13 to a solution of p-nitrophenyl chloroformate (or another phosgene analog), 
providing 14. The PMB ether is deprotected, and the resulting alcohol is converted into a triflate 
and eliminated under kinetic conditions with a sterically hindered base to afford 
vinyloxycarbonate 15. Photodeprotection of the nitrobenzyl either and nitrobenzyl carbamate 
yields alcohol 16. 
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[00167] The monomer synthesis is completed by the sequential coupling of three components. 
Chlorodiisopropylaminophosphine 17 is synthesized by the reaction of PCI3 with- 
diisopropylamine (King et al J. Org. Ghent 49:1784, 1984; incorporated herem by reference). 
Resin-bound (or 3'-(?-nitrobenzylether protected) nucleoside 18 is coupled to 17 to afford 
phosphoramidite 19. Subsequent coupling of 19 with the nucleoside 20 (Inoue et al J, Am, 
Chem, Soc. 103:7666, 1981; incorporated herein by reference) provides 21. Alcohol 16 is then 
reacted with 21 to yield, after careful oxidation using MCPBA or h followed by cleavage from 
the resin (or photodeprotection), the completed monomer 22. This strategy of sequential 
coupling of 17 with alcohols has been successfully used to generate phosphates bearing three 
different alkoxy substituents in excellent yields (Baimwarth et al Helv. Chim, Acta 70:175, 
1987; incorporated herein by reference). 
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[00168] The unique start and stop monomers used to initiate and terminate carbamate 
polymerization may be syntliesized by simple modification of the aboye scheme. 
[00169] B) Evolvable Functionalized Peptide-Niicleic Acids (PNAs): In another 
embodiment an amplifiable peptide-nucleic acid library is created. Orgel and co-workers have 
demonstrated that peptide-nucleic acid (PNAs) oligomers are capable of efficient polymerization 
on complementary DNA or RNA templates (Bohler et al. Nature 376:578, 1995; Schmidt et al 
NucL Acids Res, 25:4792, 1997; each of which is incorporated herein by reference). This 
finding, together with the recent synthesis and characterization of chiral peptide nucleic acids 
bearing amino acid side chains (Haaima et al. Angew: Chem. Int. Ed Engl 35:1939-1942, 1996; 
Pilschl et al Tetrahedron Lett 39:4707, 1998; each of wliich is incorporated herein by 
reference), allows the union of the polymer backbone and the growing nucleic acid strand iuto a 
single structure. In this example, each template consists of a DNA hairpin terminating in a 5' 
amino group; the solid-phase and solution syntheses of such molecules have been previously 
described (Uhlmann et al Angew. Chem. Int. Ed. Engl 35:2632, 1996; incorporated h^ein by 
reference). Each extension monomer consists of a PNA trimer (or longer) bearing side chains 
contaimng functionality of interest. . An artificial genetic code is written to assign each 
trinucleotide to a different set of side chains. Assembly, activation (with a carbodiimide and 
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appropriate leaving group, for example), and polymerization of the PNA dimers along the 
complementary DNA template in the carboxy- to amino-terminal direction affords the unnatural 
polymer (Figure 20). Choosing a "stop" monomer with a biotinylated N-terminus provides a 
convenient way of terminating the extension and purifying full-length polymers. The resulting 
polymers, covalently linked to their encoding DNA, are ready for selection, sequencing, or 
mutation. 

[00170] The experimental approach towards miplementing an evolvable functionalized 
peptide nucleic acid library comprises (i) improving and adapting known chemistry for the high 
efficiency template-directed polymerization of PNAs; (ii) defining a codon format (length and 
composition) suitable for PNA coupling of a number of diverse monomers on a complementary 
strand of encoding DNA free j&om significant infidelity, firamshifting, or spurious initiation of 
polymerization; (iii) choosing an initial set of side chmns defining our new genetic code and 
synthesizing corresponding monomers; and (iv) subjecting a library of functionalized PNAs to 
cycles of selection, amplification, and mutation and characterizing the resulting evolved 
molecules to understand the basis of their novel activities. 

[00171] (i) Improving coupling chemistry: While Orgel and coworkers have reported 
template-directed PNA polymerization, reported yields and number of successful couplings are 
significantiy lower than would be desired. A promising route towards improving this key 
couplirg process is exploring new coupling reagents, temperatures, and solvents which were not 
previously investigated (presumably because previous eiBForts focused on conditions which could 
have existed on prebiotic earth). The development of evolvable functionalized PNA polymers 
involves employmg activators (DCC, DIG, EDC, HATU/DIEA, HBTU/DIEA, ByBOP/DIEA, 
chloroacetonitrile), leaving groups (2-methylimidazoIe, imidazole, pentafluorophenol, phenol, 
thiophenol, trifluoroacetate, acetate, toluenesulfonic acids, coenzyme A, DMAP, ribose), 
solvents (aqueous at several pH values, DMF, DMSO, chlorofonn, TFE), and temperature (0°C, 
4**C, 25°C, 3TC, 55°C) in a large combinatorial screen to isolate new couplmg conditions. Each 
well of a 384-well plate is assigned a specific combination of one activator, leaving group, 
solvent, and temperature. Solid-phase synthesis beads covalentiy linked to DNA hsdrpin 
templates are placed in each well, together witii a fluorescentiy labeled PNA monomer 
complementary to the template. A successful coupling event results in the covalent linking of 
the fluorophore to the beads (Figure 26); undesired non-templated coupling can be distinjguished 
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by control reactions with mismatched monomers. Following bead washing and cleavage of the 
product from solid support, each well is assayed with a fluorescence plate reader. 
[00172] (ii) Defining a codon format: While Nature has successfully employed a triplet 
codon in protein biosynthesis, a new polymer assembled under very different conditions without 
the assistance of enzymes may require an entirely novel codon format Frameshifting may be 
remedied by lengthening each codon such that hybridizmg a codon out of firame guarantees a 
mismatch (for example^ by starting each codon with a G and by restricting subsequent positions 
in the codon to T, C, and A). Thermodynamically, one would also expect fidelity to improve as 
codon length increases to a certain point Codons that are excessively long, however, will be 
able to hybridize despite mismatched bases and moreover complicate monomer synthesis. An 
optimal codon length for high fidelity artificial translation can be defined using an optimized 
plate-based combinatorial screen developed above. The lengdi and composition of each codon in 
the template is varied by solid-phase synthesis of the appropriate DNA hairpin. These template 
hairpins are then allowed to couple with fluorescently labeled PNA monomers of varying 
sequence. The ideal codon format allows only monomers bearing exactly complementary 
sequences to couple with templates, even in the presence of mismatched PNA monomers (which 
are labeled differently to facilitate assaying of matched versus mismatched coupling). Triplet 
and quadruplet codons in which two bases are varied among A, T, and C while the remaining 
base or bases are fixed as G to ensure proper registration during polymerization are first studied. 
[00173] (in) Writing a new genetic code: Side chains are chosen which provide interesting 
functionality not necessarily present in natural biopolymers, which are synthetically accessible, 
and which are compatible with coupling conditions. For example, a simple genetic code which 
might be used to evolve a Ni^^ chelating PNA consists of a variety of protected carboxylate- 
bearing side chains as well as a set of small side chains to equip polymers with conformational 
flexibility and structural diversity (Figure 27). Successful selection of PNAs capable of binding 
Ni*^^ with high afBnity could be followed by an expansion of this genetic code to include a 
fluorophore as well as a fluorescent quencher. The resulting polymers could then be evolved 
towards a fluorescent Ni"^^ sensor which possesses different fluorescent properties in the absence 
or presence of nickel. Replacing the fluorescent side chain with a photocaged one may allow the 
evolution of polymers that chelate Ni"*"^ in the presence of certaui wavelengths of light or which 
release Ni*^ upon photolysis. These simple examples demonstrate the tremendous flexibility in 
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potential chemical properties of evolvable unnatural molecules conferred by the freedom to 
incorporate synthetic building blocks no longer limited to those compatible with the ribosomal 
machinery. 

[00174] (iv) Selecting for desired unnatural polymers: Many of the methods developed for 
the selection of biological molecules can be applied to selections for evolved PNAs with desired 
properties. Like nucleic acid or phage-display selections, libraries of unnatural polymers 
generated by the DNA-templated polymerization methods described above are self-tagged and 
therefore do not need to be spatially separated or synthesized on pins or beads. Ni"^^ binding 
PNA may be done simply by passing the entire library resulting from translation or a random 
oligonucleotide through commercially available Ni-NTA C*His-Tag") resin precharged with 
nickel. Desired molecules bind to the resin and are eluted vsdth EDTA. Sequencing these PNAs 
after several cycles of selection, mutagenesis, and amplification reveals vMch of the initially 
chosen side chains can assemble together to form a Ni^^ receptor. In addition,' the isolation of a 
PNA Ni^^ chelator represents the PNA equivalent of a histidine tag which may prove usefrd for 
the purification of subsequent unnatural polymers. Later efforts will involve more ambitious 
selections. For example, PNAs that fluoresce in the presence of specific ligands may be selected 
by FACS sorting of translated polymers linked through their DNA templates to beads. Those 
beads that fluoresce in the presence, but not in the absence, of the target ligand are isolated and 
characterized. Finally, the use of a biotinylated "stop" monoiner as described above allows for 
the durect selection for the catalysis of many bond-fonning or bond-breaking reactions. Two 
examples depicted in Figure 28 outline a selection for a fimctionalized PNA that catalyzes the 
retroaldol cleavage of fructose 1,6-bisphdsphate to glyceraldehyde 3-phosphate and 
dihydroxyacetone phosphate, an essential step in glycolysis, as well as a selection for PNA that 
catalyzes the converse aldol reaction. 

[00175] C) Evolvable Libraries of Small Molecules: In yet another embodiment of the 
present invention, the inventive methods are used in preparing amplifiable and evolvable 
unnatural nonpolymeric molecules including synthetic drug scajBFolds. Nucleophilic or 
electrdphilic groups are individually immasked on a small molecule scaffold attached by simple 
covalent linkage or through a conunon solid support to an encoding oligonucleotide template. 
Electrophilic or nucleophilic reactants linked to short nucleic acid sequences are hybridized to 
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the corresponding templates. Sequence-specific reaction with the appropriate reagent takes place 
by proximity catalysis (Figure 29). 

[00176] Following synthetic fiinctionalization of all positions in a manner determined by the 
sequence of the attached DNA (Figures 30 & 31), the resulting encoded beads may be subjected 
to wide range of biological screens which have been developed for assaying compounds on resin. 
(Gordon et al J. Mei Chem 37:1385, 1994; Gallop et al J. Med Chem, 37:1233-1251, 1994; 
each of which is incorporated herein by reference) 

[00177] Encoding DNA is cleaved from each bead identified in the screen and subjected to 
PGR, mutagenesis, sequencing, or homologous recombination before reattachment to a solid 
support Ultimately, this system is most flexible when the encoding DNA is directly linked to 
the combinatorial synthetic scaffold without an intervening bead. In this case, entire libraries of 
compounds may be screened or selected for desired activities, their encoding DNA liberated, 
amplified, mutated, and recombined, and new compounds synthesized all in a small series of 
one-pot, massively parallel reactions. Without a bead support, however, reactivities of 
hybridized reactants must be highly efficient since only one template molecule directs the 
synthesis of the entire small molecule. 

[00178] The development of evolvable synthetic small molecule libraries relies on chemical 
catalysis provided by the proximity of DNA hybridized reactants. It will be appreciated that 
acceptable distances between hybridized reactants and unmasked reactive groups must first be 
defined for efficient DNA-templated fiinctionalization by hybridizing radiolabeled electrophiles 
(activated esters in out first attempts) attached to short oUgonucleotides at varying distances from 
a reactive nucleophile (a prunary amine) on a strand of DNA. At given timepoints, aliquots are 
subjected to gel electrophoresis and autoradiography to monitor the course of the reaction. 
Plotting the ration as a fimction of the distance (in bases) between the nucleophile and 
electrophile will define an acceptable distance window within which proximity-based catalysis 
of a DNA-hybridized reaction can take place. The width of this window ^dll determine the 
* number of distinct reactions we can encode on a strand of DNA (a larger window allows more 
reactions) as well as the nature of the codons (a larger window is required for longer codons) 
(Figure 32). 

[00179] Once acceptable distances between fimctional groups on a combinatorial synthetic 
scaffold and hybridizes reactants is determined, the codon format is determined. The 
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nonpolymeric nature of small mol^ule synthesis simplifies codon reading as fiameshifting is not 
an issue and relatively large codons may be used to ensure that each set of reactants hybridizes 
only to one region of the encoding I)NA strand. 

[00180] Once the distance of the linker between the functional group and synthetic small 
molecule scaffold and the codon format have been detennined, one can synthesize small 
molecules based on a small molecule scaffold such as the cephalosporin scaffold shown in 
Figure 31. The primary amine of 7-aminocephalosporanic acid is first protected using FMOC- 
Cl, and then the acetyl group is hydrolyzed by treatment with base. The encoding DNA template 
is then attached through an amide linkage using EDC and HOBt to the carboxylic acid group. A 
transfer molecule with an anti-KX>don capable of hybridizing to the attached DNA template is 
then allowed to hybridize to the template. The transfer molcule has associated with it through a 
disulfide linkage a primary amine bearing Rj. Activation of the primary hydroxyl group on the 
cephalosporin scaffold with DSC following treatment with TCEP affords the amine covalently 
attached to the scaffold through a carbamate linkage. Further treatment with another transfer 
unit having an amino acid leads to functionaliztion of the deprotected primary amine of the 
cephalosporin scaffold. Cephalosporin-like molecules synthesized in this manner may then be 
selected, amplified, and/or evolved using the inventive methods and compositions. The DNA 
template may be diversified and evolved using DNA shuffling. 

[00181] D) MuiiiStep Small Molecule Synthesis Programmed by DNA Templates: 
Molecular evolution requires the sequence-specific translation of an amplifiable infonnation 
carrier into the structures of the evolving molecules. This requirement has limited the types of 
molecules that have been directiy evolved to two classes, proteins and nucleic acids, because 
only these classeis of molecules can be translated firom nucleic acid sequences. As described 
generally above, a promising approach to the evolution of molecules other than proteins and 
nucleic acids uses DNA-templated synthesis as a method of translating DNA sequences into 
synthetic small molecules. DNA-templated synthesis can direct a wide variety of powerful 
chemical reactions with high sequence-specificity and without requiring structural mimicry of 
the DNA backbone. The application of this approach to synthetic molecules of useful 
complexity, however, requires the development of general metiiods to enable the product of a 
DNA-templated reaction to undergo subsequent DNA-templated transformations. The first' 
DNA-templated multi-step small molecule syntheses is described in detail herein. Together with 
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recent advances in the reaction scope of DNA-templated synthesis^ these findings set the stage 
for the in vitro evolution of synthetic small molecule libraries. 

[00182] Multi-step DNA-templated small molecule synfliesis faces two major challenges 
beyond those associated with DNA-templated synthesis in general. First, the DNA used to direct 
reagents to appropriate templates must be removed from the product of a DNA-templated 
reaction prior to subsequent DNA-templated synthetic steps in order to prevent undesired 
hybridization to the template. Second, multi-step synthesis often requires the purification and 
isolation of intermediate products^ yet common methods used to purify and isolate reaction 
products are not appropriate for multi-step synthesis on the molecular biology scale. To address 
these challenges, three distinct strategies were implemented in solid-phase organic synthesis, for 
linking chemical reagents with their decoding DNA oligonucleotides and two general approaches 
for product purification after any DNA-templated synthetic step were developed. 
[00183] When possible, an ideal reagent-oligonucleotide linker for DNA-templated synthesis 
positions the oligonucleotide as a leaving group of the reagent. Under this "autocleaving" linker 
strategy, the oligonucleotide-reagent bond is cleaved as a natmal chemical consequence of the 
reaction (Fig. 33). As the first example of this approach applied to DNA-templated chemistry, ai 
dansylated Wittig phosphorane reagent (1) was synthesized in which the decoding DNA 
oligonucleotide was attached to one of the aryl phosphine groups (I. Hughes, Tetrahedron Lett. 
1996, 37, 7595). DNA-templated Wittig olefination (as described above) with aldehyde-linked 
template 2 resulted in the efiBcient transfer of the fluorescent dansyl group from the reagent to 
the template to provide olefin 3 (Fig. 33). As a second example of an autocleaving linker, DNA- 
linked thioester 4, when activated witli Ag(I) at pH 7.0 (Zhang et al 7. Am. Chem. Soa 1999, 
121, 3311) acylated amino-terminated template 5 to afford amide product 6 (Fig. 33). 
Ribosomal protein biosynthesis uses aminoacylated tRNAs in a similar autocleaving linker 
format to mediate RNA-templated peptide bond formation. To purify desired products away 
bom unreacted reagents and bom cleaved oligonucleotides following DNA-templated reactions 
using autocleaving linkers, biotinylated reagent oligonucleotides and washing crude reactions 
vdth streptavidin-linked magnetic beads (Fig. 34) were utilized. Although this approach does not 
separate reacted templates from unreacted templates, unreacted templates can be removed in 
subsequent DNA-templated reaction and purification steps (see below). 
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[00184] Reagents bearing more than one functional group can be linked to their decoding 
DNA oligonucleotides through a second and third linker strategies. In the "scarless linker" 
approach, one functional group of the reagent is reserved for DNA-templated bond formation, 
while the second functional, group is used to attach a linker that can be cleaved without 
introducing additional unwanted chemical functionality, DNA-templated reaction is followed by 
cleavage of the linker attached through the second functional group to afford desired products 
(Fig. 33). For example, a series of aminoacylation reagents such as (D)-Phe derivative 7 were 
synthesized in which the a-amine is connected through a carbamoylethylsulfone linker (Zarling 
et al J. Immunology 1980, 124, 913) to its decoding DNA oUgonucleotide. The product (8) of 
DNA-templated amide bond formation (as described herein), using this reagent and an amine- 
terminated template (5) v^as treated with aqueous base to effect the quantitative elimiriation and 
spontaneous decarboxylation of the linker, affording product 9 containing the cleanly transferred 
ammo acid group (Fig. 33). This sulfone linker is stable in pH 7.5 or lower buffer at 25 °C for 
more than 24 h yet undergoes quantitative cleavage when exposed to pH 1 1 .8 buffer for 2 h at 37 

[00185] In some cases it may be advantageous to introduce new chemical groups as a 
consequence of linker cleavage. Under a third linker strategy, linker cleavage generates a 
"useful scar" that can be fimctionalized in subsequent steps. As an example of this class of 
linker, amino acid reagents such as the (L)-Phe derivative 10 were generated linked through 1,2: 
diols (Fruchart et al Tetrahedron Lett. 1999, 40, 6225) to then decoding DNA oligonucleotides. 
Following DNA-templated amide bond formation with amine terminated template (5), this linker 
was quantitatively cleaved by oxidation with 50 mM aqueous NaI04 at pH 5.0 to afford product 
12 containing an aldehyde group appropriate for subsequent fimctionalization (for example, in a 
DNA-templated Wittig olefination, reductive amination, or nitrolaldol addition (Fig. 33). 
[00186] Desired products generated jfrom DNA-templated reactions using the scarless or 
usefiil scar linkers can be readily purified using biotinylated reagent oligonucleotides (Fig. 34). 
Reagent oUgonucleotides together with desired products are first captured on streptavidin-linked 
magnetic beads. Any unreacted template bound to reagent by base pairing is removed by 
washing the beads with buffer containing 4 M guarddinium chloride. Biotinylated molecules 
remain bound to the streptavidin beads under these conditions. Desired product is then isolated 
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in pure form by eluting the beads with linker cleavage buffer (in the examples above, either pH 
11 or NaI04'Containing buffer), while reacted and unreacted reagents remain bound to the beads. 
[001871 Integrating the recently expanded repertoire of synthetic reactions compatible with 
DNA-templated synthesis and the linker strategies described above, multi-step DNA-templated 
small molecule syntheses can be conducted. 

[00188] In one embodiment, a solution phase DNA-templated synthesis of a non-natural 
peptide library is described generally below and is shown generally in Figure 35. As shown in 
Figure 35, to generate the initial template pool for the library, thirty synthetic biotinylated 5'- 
amino oligonucleotides of the sequence format shown in Figure 35 are acylated with one of 
thirty different natural or unnatural amino acids using standard EDC coupling procedures. Font 
bases representing a ^codon" within each amino acylated primer are designated the identity of 
the side chain (Ri). The "genetic code" for these side chains are protected with acid labile 
protecting groups similar to those commonly used in peptide synthesis. The resulting thirty 
amiiio acylated DNA primers are annealed to a template DNA oligonucleotide library generated 
by automated DNA synthesis. Primer extension with a DNA polymerase followed by strand 
denaturation and purification with streptavidin-linlced magnetic beads yield the starting template 
library (see, Figure 35). As but one general example, a solution phase DNA-templated synthesis 
of a non-natural peptide library is depicted in Figure 36. The template library is subjected to 
three DNA-templated peptide bond fomiation reactions using amino acid reagents attached to 
10-base decoding DNA oligonucleotides through the sulfone linker described above. Products of 
each step are purified by preparative denaturing polyacrylamide gel electrophoresis prior to 
linker cleavage if desired, although this may not be necessary. Each DNA-linked reagent can be 
synthesized by coupling a 3 '-amino terminated DNA oligonucleotide to the encoded amino acid 
through the bis-NHS carbonate derivative of the sulfone linker as shown in Figure 37. Codons 
are again chosen so that related codons encode chemically similar amino acids. Following each 
peptide bond formation reaction, acetic anhydride is used to cap unreacted starting materials and . 
pH 1 1 buffer is used to effect linker cleavage to expose a new amino group for the next peptide 
bond formation reaction. Once the tetrapeptide is completed, those library members bearing 
caiboxylate side chains can also be cyclized with their amino termini to form macrocyclic 
peptides, while linear peptide members can simply be N-acetylated (see Figure 36); 
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[00189J It will be appreciated that a virtually unlimited assortment of amino acid building 
blocks can be incorporated into a non-natuial peptide libiaty. Unlike peptide libraries generated 
using the protein biosynthetic machinery such as phage displayed libraries (O'Neil et al Curr. 
Opiru Struct Biol 1995, J, 443-9), mRNA displayed libraries ^Roberts et al Proc, Natl Acad, 
Sci, USA 1997, P¥, 12297-12302) ribosome displayed libraries (Roberts et al Curr. Opvt Chem, 
Biol 1999, 3, 268-73; Schaf&tzel et al J. Immunol Methods 1999, 237, 1 19-35), or intracellular 
peptide libraries (Norman et al Science 1999, 285, 591-5), amino acids with non-proteinogenic 
side chains, non-natural side chain stereochemistry, or non-peptidic backbones can all be 
incorporated into this library. In addition, the many commercially available di-, tri- and 
oligopeptides can also be used as building blocks to generate longer library members. ,The 
presence of non-natural peptides in this library may confer enhanced pharmacological properties 
such as pix)tease resistance compared with peptides generated ribosomally. Similarly, the 
macrocyclic library members may yield higher affinity ligands since the entropy loss upon 
binding their targets may be less than their more flexible linear counterparts. Based on the 
enormous variety of commercially available amino acids fitting these descriptions, the maximum 
diversity of this non-natural cyclic and linear tetrapeptde library can exceed 100 x 100 x 100 x 
100 = 10* members. 

[00190] Another example of a library using the approach described above includes the DNA- 
templated synthesis of a diversity-oriented macrobicyclic library containing 5- and 14-membered 
rings (Figure 38). Starting material for this library consists of DNA templates aminoacylated 
with a variety of side-chain protected lysine derivatives and commercially available lysine 
analogs (including aminoethyl-cysteine, aminoethylserine, and 4-hydroxylysine). In the first 
step, DNA-templated amide bond formation mih a variety of DNA-Iinked amino acids takes 
place as described in the non-natural peptide library, except that the vicinal diol linker 16 
described above is used instead of the traceless sulfone linker. Following amide bond formation, 
the diol linker is oxidatively cleaved with sodium periodate. Deprotection of the lysine 
analog'side chain amine is followed by DNA-templated amide bond formation catalyzed by 
silver trifluoroacetate between the free amiiie and a library of acrylic derived thioesters. The 
resulting olefins are treated with a hydroxy lamine to form nitrones, which undergo 1,3 -dipolar 
cycloaddition to yield the bicyclic library (Figure 38). DNA-linked reagents for this library are 
prepared by coupling lysine analogs to S'-amino-terminated template primers (Figure 35), 
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coupling aminoacylated diol linkers to 3'-amino-terminated DNA oligonucleotides (Figure 38), 
and coupling acrylic acids to 3'-thiol terminated DNA oligonucleotides (Figure 38). 
[00191] As but one example of a specific library generated firom the first general approach, 
described above, three iterated cycles of DNA-templated amide formation, traceless linker 
cleavage, and purification with streptavidin-linked beads were used to generate a non-natural 
tripeptide (Fig. 39). Each amino acid reagent was linked to a unique biotinylated 10-base DNA 
oligonucleotide through the sulfone linker described above. Hie 30-base amine-tenninated 
template programmed to direct the tripeptide synthesis contained three consecutive. 10-base 
regions that were con]^lementary to the three reagents, mimicking the strategy that would be 
used in a multi-step DNA-templated small molecule Ubrary synthesis. The first amino acid 
reagent (13) was combined with the template and activator 4-(4,6-dimethoxy-l,3,5-triaan-2-yl)- 
4-methyhnorpholinium chloride (DNfT-MM) (Kunishima et al Tet^-ahedron 2001, 57, 1551) to 
effect DNA-templated peptide bond formation. The desired product was purified by mixing the 
crude reaction with streptavidin-linked magnetic beads, washing with 4 M guanidinium chloride, 
and eluting with pH 11 buffer to effect sulfone linker cleavage, providing product 14. The fi:ee 
amine group in 14 was then elaborated in a second and third round of DNA-templated amide 
formation and linker cleavage to afford dipeptide 15 and tripeptide 16 (Figure 39). 
[00192] The progress of each reaction, purification, and sulfone linker cleavage step was 
followed by denaturing polyacrylamide gel electrophoresis. The final tripeptide linked to 
template (16) was digested with the restriction endonuclease EcolRI and the digestion firagment 
containing the tripeptide was characterized by MALDI mass spectrometry. Beginning with 2 
nmol (~ 20 \ig) of starting material, sufficient tripeptide product was generated to serve as the 
template for more than 10^ in vitro selections and PGR reactions (Kramer et al. in Current 
Protocols in Molecular Biology, Vol 3 (Ed.: F. M. Ausubel), Wiley, 1999, pp. 15.1) (assuming 
1/10,000 molecules survive selection). No significant product was generated when the starting 
material template was capped with acetic anhydride, or when control reagents containing 
sequence mismatches were used instead of the complementary reagents (Fig. 39),' 
.100193] A non-peptidic multi-step DNA-templated small molecule synthesis (Fig. 40) that 
uses all three linker strategies developed above was also performed. An amine-terminated 30- 
base template was subjected to DNA-templated amide bond formation using an aminoacyl donor 
reagent (17) containing the diol linker and a biotinylated 10-base oligonucleotide to afford amide 
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18. The desired product was iisolated by capturing the crude reaction on streptavidin beads 
followed by cleaving the linker with NaI04 to generate aldehyde 19. The DNA-templated Wittig 
reaction of 19 with the biotinylated autocleaving phosphorane reagent 20 afforded flimaramide 
21. The products ftom the second DNA-templated reaction were partially purified by washing 
with streptavidin beads to remove reacted and unreacted reagent. In the third DNA-templated 
step, fumaramide 21 was subjected to a DNA-templated conjugate addition (Gartner et al. J. Am, 
CherrL Soc, 2001, 725, 6961) using thiol reagent 22 linked through the sulfone linker to a 
biotinylated oligonucleotide. The desired conjugate addition product (23) was purified by 
ixmnobiUzation with streptavidin beads. Linker cleavage with pH 11 buffer afforded final 
product 24 in S-10% overall isolated yield for the three bond forming reactions, two linker 
cleavage steps, and three purifications (Figure 40). This final product wais digested with EcoBI 
and the mass of the small molecule-linked template fragment was confirmed by MALDI mass 
spectrometry (exact mass: 2568, observed mass: 256&t5). As in the tripeptide example, each of 
the three reagents used during this multi-step synthesis aimealed at a unique location on the DNA 
template, and control reactions with sequence mismatches yielded no product (Fig. 40). As 
expected, control reactions in which the Wittig reagent was omitted (step 2) also did not generate 
product following the third step. Taken together, the DNA-templated syntheses of 16 and 24 
demonstrate the ability of DNA to direct the sequence-programmed multi-step synthesis of both 
oligomeric and non-oligomeric small molecules unrelated in structure to nucleic acids. 
[00194] The commercial availability of many substrates for DNA-templated reactions 
including amines, carboxylic acids, a-halo carbonyl compounds, olefins, alkoxyamines, 
aldehydes, and nitroalkanes may allow the translation of large libraries of DNA into diverse 
small molecule libraries. The direct one-pot selection of these libraries for members with desired 
binding or catalytic activities, followed by the PGR amplification and diversification of the DNA 
encoding active molecules, may enable synthetic small molecules to evolve in a manner 
paralleling the powerful metliods Nature uses to generate new molecular fimction. In addition, 
multi-step nucleic acid-templated synthesis is a requirement of previously proposed models (A. I. 
Scott, Tetrahedron Lett 1997, 38, 4961; Li et al Nature 1994, 369, 218; Tamura et al Proc, 
Natl Acad Sci USA 2001, P5, 1393) for the prebiotic translation of replicable information into 
fiinctional molecules. These findings demonstrate that nucleic acid templates are indeed capable 
of directing iterative or nonriterative multi-step small molecule synthesis even when reagents 
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anneal at widely varying distances from the growing molecule (in the above examples, zero to 
twenty bases). As described in more detail below, libraries of synthetic molecules can then be 
evolved towards active ligand and catalysts through cycles of translation, selection, amplification 
and mutagenesis. 

[00195] E) Evolving Plastics: In yet another embodiment of the present invention, a nucleic 
acid (e.g., DNA, RNA, derivative thereof is attached to a polymerization catalyst Since nucleic 
acids can fold into complex structures, the nucleic acid can be used to direct and/or affect the 
polymerization of a growing polymer chainv For example, the nucleic acid may influence the 
selection of monomer units to be polymerized as well as how the polymerization reaction takes 
place stereochemistry, tacticity, activity). The synthesized polymers may be selected for 
specific properties such molecular, weight, density, hydrophobicityj tacticity, sterieoselectivity, 
etc,^ and the nucleic acid which formed an integral part of the catalyst which directed its 
synthesis may be amplified and evolved (Figure 41). Iterated cycles of ligand diversification, 
selection, and amplification allow for the true evolution of catalysts and polymers towards 
desired properties. 

[00196] To give but one example, a library of DNA molecules is attached to Grubbs' 
ruthenium-based ring opening metathesis polymerization (ROMP) catalyst through a 
dihydroimidazole ligand (SchoU et al Org. Lett, 1(6):953, 1999; incorporated herein by 
reference) creating a large, diverse pool of potential catalytic molecules, each unique by nature 
of the fimctionalized ligand. Undpubtedly, fimctionali2ing the catalyst with a relatively large 
DNA-dehydroimidazole (DNA-DHI) ligand will alter the activity of the catalyst. Each DNA 
molecule has the potential to fold mto a unique stereoelectronic shape which potentially, has 
different selectivities and/or activities in the polymerization reaction (Figure 42). Therefore, the 
library of DNA ligands can be "translated" into a library of plastics upon the addition of various 
monomers. In certain embodiments, DNA-DHI ligands capable of covalently inserting 
themselves into the growing polymer, thus creating a polymer tagged with the DNA that encoded 
its creation, are used. Using the synthetic scheme shown in Figure 42, DHI ligands are produced 
containing two chemical handles, one used to attach the DNA to the ligand, the other used to 
attach a pedant olefin to the DHI backbone. Rates of metathesis are known to vary widely based 
upon olefin substitution as well as the identity of the . catalyst Through alteration of these 
variable, the rate of pendant olefin incorporation can be modulated such that A^endant olefin metathesis 
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« *ROMP, thereby, allowing polymers of moderate to high molectilar weights to be fdimed 
before insertion of the DNA tag and corresponding polymer termination. Vinylic either are 
commonly used in ROMP to fimctionalize the polymer termini (Gordon et al Chem. Biol 7:9- 
16, 2000; incorporated herein by reference), as well as produce polymers of deaeased molecular 
weight 

[00197] Subsequent selection of a polymer from the library based on a desired property by 
electrophoresis, gel filtration, centrifiigal sedimentation, partitioning into solvents of diflferent 
hydrophobicities, etc. Amplification and diversification of the coding nucleic acid via 
techniques such as error-prone PCR or DNA shuffling followed by attachment to a DHI 
backbone will allow for production of another pool of potential ROMP catalysts enriched in the 
selected activity (Figure 43). This method provides a new approach to generating polymeric 
materials and the catalysts that create them. 

[00198J Example 6; Characterization of DNA-Templated Synthetic Small Molecule 
Libraries: T he non-natural peptide and bicyclic libraries described above are characterized in 
several stages. Each candidate reagent is conjugated to its decoding DNA oligonucleotide, then 
subjected to model reactions witli matched and mismatched templates. The products from these 
reactions are analyzed by denaturing polyacrylamide gel electrophqresis to assess reaction ; 
efficiency, and by mass spectrometry to verify anticipated product structures. Once a complete 
set of robust reagents are identified, the complete multi-step DNA-templated syntheses of 
representative single library members on a large scale is perfonned and the final products are 
characterized by mass spectrometry. 

[00199] More specifically, the sequence fidelity of each multi-step DNA-templated library 
synthesis is tested by follovdng the fate of single chemically labeled reagents through the course 
of one-pot library synthesis reactions. For example, products arising from building blocks 
bearing a ketone group are captured with conmiercially available hydrazide-linked resin and 
analyzed by DNA sequencing to verify sequence fidelity during DNA-templated synthesis. 
Similarly, when using npn-biotinylated model templates, building blocks bearing biotin groups 
are purified after DNA-templated synthesis using streptavidin magnetic beads and subjected to 
DNA sequencing (Liu et al, 1 Am, Chem Soc. 2001, 123, 6961-6963) Codons that show a 
greater propensity to anneal with mismatched DNA are identified by screening in this manner 
and removed from the genetic code of these synthetic libraries. 



63 of 88 



wo 02/074929 PCTAJS02/08546 

(00200] Example 7; In Vitro Selection of Protein Ligands from Evolvable Synthetic 
Libraries: B ecause every libraiy member generated in this approach is covalently linked to a 
DNA oligonucleotide that encodes and directs its synthesis, libraries can be subjected to true in 
vitro selections. Although direct selections for small molecule catalysts of bond-forming or 
bond-cleaving reactions are an excitmg potential application of this approach, the simplest in 
vitro selection that can be used to evolve these libraries is a selection for binding to a target 
protein. An ideal initial target protein for the synthetic library selection both plays an important 
biological role and possesses known ligands of varying afSnities for validating the selection 
methods. 

[00201] One receptor of special interest for use in the present invention is the.avPs receptor. 
The avp3 receptor is a member of the mtegrin family of transmembrane heterodimeric 
glycoprotein receptors (Miller et al. Drug Discov Today 2000, 5, 397-408; Berman et al Membr 
Cell Biol. 2000, 13, 207-44) The avPs integrin receptor is expressed on the surface of many cell 
types such as osteoclasts, vascular smooth muscle cells, endothelial cells, and some tumor cells. 
This receptor mediates several important biological processes including adhesion of osteoclasts 
to the bone matrix (van der Pluijm et al J. Bone Miner. Res. 1994, 9, 1021-8) smooth muscle 
cell migration (Choi et al J. Vase. Surg. 1994, 19, 125-34) and tumor-induced angiogenesis 
(Brooks et al. Cell 1994, 79, 1157-64) (the outgrowth of new blood vessels). During tumor- 
induced angiogenesis, invasive endothelial cells bind to extracellular matrix components through 
their avPs integrin receptors. Several studies (Brooks et al Cell 1994, 79, 1157-64; Broolcs et 
al Celll99S,92, 391-400; FMandcv et al Science 1995,270, 1500-2; Vamer era/, CellAdhes 
Commun 1995, 3, 367-74; Brooks etal J. Clin Invest 1995, 96, 1815-22) have demonstrated that 
the inhibition of this integrin binding event with antibodies or small synthetic peptides induces 
apoptosis of the proliferative angiogenic vascular cells and can inhibit tumor metastasis. 
[00202] A number of peptide ligands of varying affinities and selectivities for the avPs 
integrin receptor have been reported. Two benchmark avPa integrin antagonists are the linear 
peptide GRGDSPK (IC50 = 210 nM (Dechantsreiter et al l Med. Chem. 1999, ¥2, 3033-40; 
Pfaff al J, Biol Chem. 1994, 269, 20233-8) and the cyclic peptide cyclo-RGDfV (PfaflFfi/ al 
J, Biol Chem, 1994, 269, 20233-8)* (f= (D)-Phe, IC50 = 10 nM). WhUe peptides antagonists for 
integrins commonly contain ROD, not all RGD-containing peptides are high aflBnity integrin 
ligands. Rather, the conformational context of RGD and other peptide sequences can have a 
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profound effect on integrin affinity and specificity (Wermuth et al J, Am. Chem. Soc. 1997, 119, 
1328-1335; Geyer et al 1 Am. Chem. Soc. 1994, 116, 7735-7743; Rai et al. Bioorg. Med Chem. 
Lett 2001, 7/, 1797-800; .Rai et al Cvrr. Med. Chem. 2001, 5, 101-19) For this reason, 
combinatorial approaches towards avp3 integrm receptor antagonist discovery are especially 
promising. 

(00203] The biologically important and medicinally relevant role of the avPa integrin receptor 
together vvith its known peptide antagonists and its commercial availability (Chemicon 
International, Inc., Temecula, CA) make the avPa integrin receptor an ideal initial target for 
DNA-templated synthetic small molecule libraries. The avp3 integrin receptor can be 
immobilized by adsorption onto microtiter plate wells without impairing its ligand binding 
ability or specificity (Dechantsreiter et al J. Med Chem. 1999, 42, 3033-40; Wermuth et al J. 
Am, Chem, Soc. 1997, 119, 1328-1335; Haubner et al X Am, Chem. Soc. 1996, 118, 7461-7472). 
Alternatively, the receptor can be immobilized by conjugation with NHS ester or maleimide 
groups covalently linked to sepharose beads and the ability of the resulting integrin aflBnity resin 
to maintain known ligand binding properties can be verified. 

[00204] To perform the actual protein binding selections, DNA template-linked synthetic 
peptide or macrocychc libraries are dissolved in aqueous binding buffer in one pot and 
equilibrated in the presence of immobilized avPa integrin receptor. Non-binders are washed 
away with buffer. Those molecules that may be binding through their attached DNA templates 
rather than through their syntlietic moieties are eliminated by washing the bound library with 
unfimctionalized DNA templates lacking PGR primer binding sites. Remaimng ligands bound to 
the immobilized avp3 integrin receptor are eluted by denaturation or by the addition of excess 
high affinity RGD-containing peptide ligand. The DNA templates that encode and direct the 
syntheses of avPa integrin binders are amplified by PGR using one primer designed to bind to a 
constant 3' region of the template and one pool of biotinylated primers functionalized at its 5' 
end with the library starting materials (Fig. 44). Purification of the biotinylated strand completes 
one cycle of synthetic molecule translafion, selection, and amplification, yielding a sub- 
population of DNA templates enriched in sequences that encode syntiietic avPs integrin ligands. 
[00205] For reasons similar to those that make the avPs integrin receptor an attractive initial 
target for the approach to generating synthetic molecules with desired properties, the factor Xa 
serine protease also serves as a promising protein target. Blood coagulation involves a complex 
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cascade of enzyme-catalyzed reactions that ultimately generate fibrin, the basis of blood clots 
(Rai c/ al. Curr. Med Cherru 2001, 8, 101-109; Vacca et al Curr, Opiru Chem Biol. 2000, 4, 
394-400) Thrombin is the serine protease that converts fibrinogen into fibrin during blood 
clotting. Thrombin, in turn, is generated by the proteolytic action of factor Xa on prothrombin. 
Because thromboembolitic (blood clotting) diseases such as stroke remain a leading cause of 
death m the world (Vacca et a/. Curr, Opin Ghent Biol 2000, 4, 394-400) the development of 
drugs that inhibit thrombin or factor Xa is a major area of pharmaceutical research. The 
inhibition of factor Xa is a newer approach thought to avoid the side effects associated with 
inhibiting thrombin, which is also involved in noraial hemostasis (Maignan et al J. Med. Chejjt 
2000, 43, 3226-32; Leadley et al J. Cardiovasc, Pharmacol 1999, 34, 791-9; Becker etal 
Bioorg. Med Chem Lett. 1999, P, 2753-8; Choi-Sledeski et al, Bioorg, Med Chem Lett. 1999, 
P, 2539-44; Choi-Sledeski et al J. Med Chem. 1999, 42, 3572-87; Ewmg et al J. Med Chem. 
1999, 42, 3557-71; Bostwick et al Thromb Haemost 1999, 81, 157-60). Although many agents 
including heparin, hirudin, and hirulog have been developed to control the production of 
thrombin, these agents generally have the disadvantage of requiring intravenous or subcutaneous 
injection several times a day in addition to possible side effects, and the search for synthetic 
small molecule factor Xa inhibitors remains the subject of great research effort. 
[00206] Among factor Xa inhibitors with known binding afiEmities are a series of tripeptides 
ending with arginine aldehyde (Marlowe et al Bioorg. Med Chem Lett. 2000, JO, 13-16) that 
are easily be included in the DNA-templated non-natural peptide library described above. 
Depending on the identities of the first two residues, these tripeptides exhibit IC50 values ranging 
firom 15 nM to 60 fiM (Marlowe et al Bioorg. Med Chem Lett. 2000, 10, 13-16) and therefore 
provide ideal positive controls for validating and calibrating an in vitro selection for synthetic 
factor Xa ligands (see below). Both factor Xa and active factor Xa immobilized on resin are 
conunercially available (Protehi Engineering Technologies, Denmark). The resin-bound factor 
Xa is used to select members of both the DNA-templated non-natural peptide and bicyclic 
libraries with factor Xa affinity in a manner analogous to the integrin receptor binding selections 
described above. 

[00207] Following PGR amplification of DNA templates encoding selected synthetic 
molecules, additional rounds of translation, selection, and amplification are conducted to enrich 
the library for the highest affinity binders. The stringency of the selection is gradually increased 
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by increasing the salt concentration of the binding and washing buffers, d^icreasing the duration 
of binding, elevating the binding and washing temperatures, and increasing the concentration of 
washing additives such as template DNA or unrelated proteins. Importantiy, in vitro selections 
can also select for specificity in addition to binding afiinity. To eliminate those molecules that 
possess undesired binding properties, library members bound to immobilized avPa integrin or 
factor Xa are washed with non-target proteins such as other integrins or other serine proteases, 
leaving only those molecules that bind the target protein but do not bind non-target proteins. 
[00208] Iterated cycles of translation, selection, and amplification results in library enrichment 
rather than Ubrary evolution, which requires diversification between rounds of selectioa 
Diversification of these synthetic libraries are achieved in at least two ways, both analogous to 
methods used by Nature to diversify proteins. Random point mutagenesis is performed by 
conducting the PGR amplification step under error-prone PGR (Galdwell e^ al PCR Methods 
Applic. 1992, 2, 28-33) conditions. Because the genetic code of these molecules are written to 
assign related codons to related chemical groups, similar to the way that the natural protein 
genetic code is constructed, random point mutations in the templates encoding selected 
molecules will diversify progeny towards chemically related analogs. In addition to point 
mutagenesis, synthetic libraries generated in this approach are also diversified using 
recombination. Templates to be recombined have the structure shown in Fig. 45, in which 
codons are separated by five-base non-palindromic restriction endonuclease cleavage sites such 
as those cleaved by AvaU (G/GWGG, W=A or T), Sau96i (G/GNGC, N=A, G, T, or C), Ddel 
(C/TNAG), or HiriBl (G/ANTC). Following selections, templates encoding desired molecules 
are enzymatically . digested with these commercially available restriction enzymes. The digested 
jfrj^ents are then recombined into intact templates with T4 DNA ligase. Because tiie 
restriction sites separating codons are nonpalindromic, templates firagments can only reassemble 
to form intact recombined templates (Fig. 45). DNA-templated translation of recombined 
templates provides recombined small molecules. In this way, fimctional groups between 
synthetic small molecules with desired activities are recombined in a manner analogous to the 
recombination of amino acid residues between proteias in Nature. It is well appreciated that 
recombination explores the sequence space of a molecule much more efficiently than point 
mutagenesis alone (Minshuli etal Cwr, Opin Chem Biol 1999, J, 284-90; Bogarad etal Proc. 
Natl Acad, Set USA 1999, 96, 2591-5; Stemmer, W. Nature 1994, 570, 389-391). 
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[00209] Small molecule evolution using mutation and recombination offers two potential 
advantages over simple enrichment If the total diversity of the library is much less than the 
nxunber of molecules made (typically 10*^ to 10^^, every possible library member is present at 
the start of the selection. In this case, diversification is still useM because selection conditions 
almost always change as rounds of evolution progress. For example, later rounds of selection 
will likely be conducted under higher stringencies, and may involve counter selections against 
binding nori-target proteins. Diversification gives library members that have been discarded 
during earlier rounds of selection the chance to reappear in later rounds under altered selection 
conditions in vdiich their fitness relative to other members may be greater. In addition, it is quite 
possible using this approach to generate a synthetic library that has a theoretical diversity greater 
than 10^^ molecules. In this case, diversification allows molecules that never existed in the 
original library to emerge in later rounds of selections on the basis of their similarity to selected 
molecules, similar to the way in- which protein evolution searches the vastness of protein 
sequence space one small subset at a time. 

[00210] Example 8; Characterization of Evolved Compounds: F ollowing multiple rounds 
of selection, amplification, diversification, and translation, molecules surviving the selection will 
be characterized for their ability to bind the target protein. To identify the DNA sequences, 
encodmg evolved synthetic molecules surviving the selection, PCR-amplified templates are 
cloned into vectors, transformed into cells, and sequenced as individual clones. DNA 
sequencing of these subcloned templates reveal the identity of the synthetic molecules surviving 
the selection. To gain general information about the functional groups being selected during 
rounds of evolution, populations of templates are sequenced in pools to reveal the distribution of 
A, G, T, and C at every codon position. The judicious design of each functional group's genetic 
code allows considerable information to be gathered fh)m population sequencing. For example, 
a G at tfie first position of a codon may designate a charged group, while a C at this position may 
encode a hydrophobic substituent 

[00211] To validate the integrin binding selection and to compare selected library members 
with known avp3 integrin ligands, linear GRGDSPK and a cyclic RGDfV analog (cyclic iso- 
ERGDfV) are also mcluded in the DNA-templated cyclic peptide library. The selection 
conditions are adjusted imtil verification that libraries containing these known integrin ligands 
undergo enrichment of the DNA templates encoding the known ligands upon selection for 
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integrin binding. In addition, the degree of enrichment of template sequences encoding these 
known avpa integrin ligands is conelated with their known affmities and with the enrichment 
and affinity of newly discovered avp3 integrin ligands. 

(0Q212] Once the enrichment of template sequences encoding known and new integrin ligands 
is confirmed, novel evolved ligands will be synthesized by non-DNA templated synthesis and 
assayed for their avPa integrin receptor antagonist activity and specificity. Standard in vitro 
binding assays to mtegrin recqptors (Dechantsreiter'cf al, 1 Med Chem. 1999, 42, 3033-40) are 
performed by competing the binding of biotmylated fibrinogen (a natural integrin ligand) to 
immobilized integrin receptor with the ligand to be assayed. The inhibition of blading to 
fibrinogen is quantitated by incubation with an alkaline phosphatase-conjugated anti-biotin 
antibody and a chromogenic alkaline phosphate substrate. Comparison of the binding affinities 
of randomly chosen library members before and after selection will validate the evolution of the 
library towards target binding. Assays for binding non-target proteins reveal the ability of these 
libraries to be evolved towards binding specificity in addition to binding affinity, 
[00213] Similarly, the selection for factor Xa binding is validated by iacluding the known 
factor Xa tripeptide inhibitors in the library design and verifying that a round of factor Xa 
binding selection and PGR amplification results in the enrichment of their associated DNA 
templates. Synthetic library members evolved to bind factor Xa are assayed in vitro for their 
ability to inhibit factor Xa activity. Factor Xa inhibition can be readily assayed 
spectrophotometrically using the commercially available chromogeiuc substrate S-2765 
(Chxomogenix, Italy). 

[00214] While the DNA sequence alone of a non-natural peptide library member is likely to 
reveal the exact identity of the corresponding peptide, the final step in the bicyclic library 
synthesis is a non-DNA-templated intramolecular 1,3-dipolar cycloaddition that may yield 
diastereomeric pairs of regioisomers. Although modeling strongly suggests that only the 
regioisomer shown in Fig, 38 can form for steric reasons, facial selectivity is less certain. 
Diastereomeric purity is not a requirement for the in vitro selections described above since each 
molecule is selected on a single molecule basis. Nevertheless, it may be useful to characterize 
the diastereoselectivity of the dipolar cycloaddition. To accomplish this, non-DNA-templated 
synthesis of selected bicyclic library members is performed, diastereomers are separated by 
chiral preparative HPLC, and product stereochemistry by nOe or X-ray diffiraction is determined. 
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[0021S] Example 9; Translating DNA into Non-Natural Polymers Using DNA 
Pblvmerases; An alternative approach to translating DNA into non-natural, evolyable polymers 
takes advantage of the ability of some DNA polymerases to accept certain modified nucleotide 
triphosphate substrates (D. M. Perrin et al J, Am. Ghent Soc. 2001, 123, 1556; D, M. Perrin et 
aL Nucleosides Nucleotides 1999, 18, 377-91; T. Gourlain et al Nucleic Acids Res. 2001, 2P, 
1898-1905; S. E. Lee et al Nucleic Acids Res. 2001, 2P, 1565-73; K. SaklMevel et al Angew. 
Chem Int Ed 1998, 37, 2872-2875). Several deoxyribonucleotides (Fig. 45) and ribonucleotides 
bearing modifications to groups that do not participate in Watson-Crick hydrogen bonding are 
known to be inserted with high sequence fidelity opposite natural DNA templates. Importantly, 
single-stranded DNA containing modified nucleotides can serve as efficient templates for the 
DNA-polymerase-catalyzed incorporation of natural or modified mononucleotides. In one of the 
earliest examples of modified nucleotide incorporation by DNA polymerase, Toole and co- 
workers reported the acceptance of 5-(l-pentynyl)-deoxyuridine 1 by Vent DNA polymerase 
under PGR conditions (J. A. Latham et al Nucleic Acids Res. 1994, 22, 2817-22). Several 
additional S-ftinctionalized deoxyuridines (2-7) derivatives were subsequently found to be 
accepted by thermostable DNA polymerases suitable for PGR (K. Sakthievel et al Angew, 
Chem. Int. Ed 1998, J7, 2872-2875), The first fimctionalized purine accepted by DNA 
polymerase, deoxy adenosine analog 8, was mcoiporated into DNA by T7 DNA polymerase 
together with deoxyuridine analog 7 (D. M. Perrin et al Nucleosides Nucleotides 1999, 18, 377- 
91). DNA libraries containing both 7 and 8 were successfully selected for metal-independent 
RNA cleaving activity (D. M. Perrin et al J. Am. Chem. Soc. 2001, 123, 1556-63). Williams and 
co-workers recently tested several deoxyuridine derivatives for acceptance by Tag DNA 
polymerases and concluded that acceptance is greatest when using G5-modified uridines bearing 
rigid alkyne or /rans-alkene groups such as 9 and 10 (S. E. Lee et al Nucleic Acids Res. 2001, 
29, 1565-73). A similar study (T. Gourlain et al Nucleic Acids Res, 2001, 29, 1898-1905) on 
C7-fimctionalized 7-deaza-deoxyadenosines revealed acceptance by Tag DNA polymerase of 7- 
aminopropyl- (11), cis-7-aminopropenyl- (12), and 7-aminopropynyl-7Hlea2adeoxyadenosine 
(13). 

[00216] The functionalized nucleotides incorporated by D>JA polymerases to date, shown in 
Fig. 46, have focused on adding "protein-like" acidic and ba§ic functionality to DNA. While 
equippiug nucleic acids with general acid and general base functionality such as primary amine 
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and carboxylate groups may increase the capability of nucleic acid catalysts, tlie fimctional 
groups present in natural , nucleic acid bases already have demonstrated the ability to serve as 
general acids and bases. The hepatitis delta ribozyme, for example, is thought to use the pK^r 
modulated endocyclic amine of cytosine 75 as a general acid (S. Nakano et al. Science 2000, 
257, 1493-7) and the peptidyl transferase activity of the ribosome may similarly rely on general 
base or general acid catalysis (G. W. Muth et al Science 2000, 289, 947-50; P. Nissen et al 
Science 2000, 289, 920-930; N. Ban et al Science 2000, 289, 905-920) although the latter case 
remains the subject of ongoing debate (N. Polacek et cd. Nature 2001, 411, 498-501). Equipping 
DNA bases with additional Brensted acidic and basic groups, therefore, may not profoundly 
expand the scope of DNA catalysis. 

[00217] In contrast with simple general acid and general base functionality, chiral metal 
centers would expand considerably the chemical scope of nucleic acids. Functionality aimed at 
binding chemically potent metal centers has yet to been incorporated into nucleic acid polymers. 
Natural DNA has denionstrated the ability to fold in conaplex three-dimensional structures 
capable of stereospecifically binding target molecules (C. H. Lin et al Chem. Biol. 1997, 4, 817- 
32; C. H. Lin etai Chem. Biol 1998, J, 555-72; P. Schultze etal J. Mol Biol 1994, 235, 1532- 
47) or cataly2dng phosphodiester bond manipulation (S. W, Santoro et al Proc. Natl Acad ScL 
USA 1997, 94, 4262-6; R. R. Breaker et al Chem, Biol 1995, 2, 655-60; Y. Li et al 
Biochemistry 2000, 39, 3106-14; Y. Li et al Proa Natl Acad Set USA 1999, 96, 2746-51). 
DNA depurination (T. L. Sheppard et al Proc. Natl Acad. Sci USA 2000; 97, 7802-7807) and 
porphyrin nietallation (Y. Li et al Biochemistry 1997, 36, 5589-99; Y. Li et al Nat. Stfuct Biol 
1996, 3, 743-7). Non-natural nucleic acids ai^iented with the ability to bind chemically potent, 
water-compatible metals such Cu, La, Ni, Pd, Rh, Ru, or Sc may possess greatly expanded 
catalytic properties. For example, a Pd-binding oligonucleotide folded into a well-defined 
structure may possess the ability to catalyze Pd-mediated coupling reactions with a high degree 
of regiospecificity or stereospecificity. Similarly, non-natural nucleic acids that form chiral Sc. 
binding sites may serve as enantioselective cycloaddition or aldol addition catalysts. The ability 
of DNA polymerases to translate DNA sequences into .these non-natural polymers coupled with 
in vitro selections for catalj^ic activities would therefore enable, the direct evolution of desired 
catalysts from random libraries. 
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[00218] Evolving catalysts in this approach addresses the difficulty of rationally designing 
catalytic active sites with specific chemical properties that has inspired recent combinatorial 
approaches (K. W. Kuntz et al Curr Opin. Ghent Biol 1999, 5, 313-319; M. B. Francis et al 
Curr. OpivL Chem. Biol, 1998, 2, 422-8) to organometallic catalyst discovery. For example, 
Hoveyda and co-workers identified Ti-bas.ed enantioselective epoxidation catalysts by serial 
screening of peptide ligands (K. D. Shimizu et al Angew. Chein, Int Ed 1997, 36) Serial 
screening was also used by Jacobsen and co-workers to identify peptide ligands that form 
enantioselective epoxidation catalysts when complexed with metal cations (M. B. Francis et al. 
Angew, Chem Int, Ed EngL 1999, 55, 937-941) Recently, a peptide library containing 
phosphine side chains was screened for the ability to catalyze malonate ester addition to 
cyclopentenyl acetate in the presence of Pd (S. R. Gilbertson et al J. Am, Chem, Sac, 2000j 722, 
6522-6523). The current approach differs 'fimdamentally from previous combinatorial catalyst 
discovery efforts, however, in that it enables catalysts with desired properties to spontaneously 
emerge from one pot, solution-phase libraries after evolutionary cycles of diversification, 
amplification, translation, and selection. This strategy allows up to lO'^ different catalysts to be 
generated and selected for desired properties in a single experiment. The compatibility of our 
approach with one-pot in vitro selections allows the direct selection for reaction catalysis rather 
than screening for a phenomenon associated with catalysis such as metal binding or heat 
generation. In addition, properties difficult to screen rapidly such as substrate stereospecificity 
or metal selectivity can be directly selected using our approach (see below). . 
100219] Key intermediates for a number of C5-fimctionalized uridine analogs and C7- 
functionalized 7-deazaadenosine analogs have been synthesized for incorporation into non- 
natural DNA polymers. In addition, the synthesis of six CS-fimctionalized adenosine analogs as 
deoxyribonucleotide triphosphates has been completed. Because only limited information exists 
on the ability of DNA polymerases to accept modified nucleotides, we chose to synthesize 
analogs were synthesized that not only will bring metal-binding functionality to nucleic acids but 
that also will provide insights into the determinants of DNA polymerase acceptance. 
[00220] The strategy for the synthesis of metal-binding uridine and 7-deazaadenosine analogs 
is shown in Fig, 47, Both routes end with amide bond formation between NHS esters of metal- 
binding fimctional groups and amino modified deoxyribonucleotide triphosphates (7 and 13). 
Analogs 7 and 13 as well as acetylated derivatives of 7 have been previously shown (D. M. 
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Penin et aL J. Ant Ghent Soa 2001, 123, 1556-63; D. M. Peirin et al Nucleosides Nucleotides 
1999, 18, 377-91; J, A. Latham et al Nucleic Acids Res. 1994, 22, 2817-22; T. Gourlain.er al 
Nucleic Acids Res. 2001, 29, 1898-1905; S. E. Lee et al Nucleic Acids Res, 2001, 29, 1565-73; 
K. Sakthivel et al Angew. Chem, Int. Ed Engl 1998, 57, 2872-2875) to be tQlerated by DNA 
polymerases, including thermostable DNA polymerases suitable for PGR. This convergent 
approach allows a wide variety of metal-binding ligands to be rapidly incorporated into either 
nucleotide analog. The synthesis of 7 has been completed following a previously reported (K. 
Sakthivel et al Angew. Ghent Int. Ed Engl 1998, J7, 2872-2875) route (Fig. 48, Phillips, 
Chorbaj Liu, unpublished results). Heck coupling of commercially available 5-iodo-2'- 
deoxyuridine (22) with N-allyltrifluoroacetamide provided 23. The 5'-triphosphate group was 
installed by treatment of 23 with trimethylphosphate, POCI3, and proton sponge (1,8- 
bis(dimethylamino)-naphthalene) followed by tri-n-butylammonium pyrophosphate, and the 
trifluoroacetamide group then removed with aqueous ammonia to afford 7. 
[00221] Several steps towards the synthesis of 13 have been completed, the key intermediate 
for 7-deazaadenosine analogs (Fig. 49). Following a known route (J. DavolL J, Am. Chem. Sac. 
1960, 52, 131-138) diethoxyethylcyanoacetate (24) was synthesized jfrom bromoacetal 25 and 
ethyl cyanoacetate (26). Condensation of 24 with thiourea provided pyrimidine 27, which was 
desulfiirized with Raney nickel and then cyclized to pyrrolopyrimidine 28 with dilute aqueous 
HCL Treatment of28 with POCI3 afforded 4-chloro-7-deazaadenine (29). The aryl iodide group 
which will serve as a Sonogashira coupling partner for installation of the propargylic amine in 13 
was installed by reacting 29 with N-iodosuccinimide to generate 4-chIoro-7-iodo-7-deazaadenine 
(30) in 13% overall yield from bromoacetal 25. 

[00222] As alternative fimctionalized adenine analogs that will botii probe the structural 
reqiurements of DNA polymerase acceptance and provide potential metal-binding functionality, 
six 8-modified deoxyadenosine triphosphates (Fig. 50) have been synthesized. AD functional 
groups were installed by addition to 8-bromo-deoxy adenosine (31), which was prepared by 
bromination of deoxyadenosine in the presence of ScCia, which we found to greatly increaise 
product yield. Methyl- (32), ethyl- (33), and vinyladenosme (34) were synthesized by Pd- 
mediated Stille coupling of the corresponding alkyl tin reagent and 31 (P. Mamos et al 
Tetrahedron Lett 1992, 33, 2413-2416). Methylamino- (35) (E. Nandanan et al. J. Med Ghem. 
1999, 42, 1625-1638), ethylamino- (36), and histaminoadenosine (37) were prepared by 
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treatment of 23 with the coirespondmg amine in water or ethanoL The 5'-nucleotide 
triphosphates of 32-37 were synthesized as described above. 

[00223] The ability of thermostable DNA polymerases suitable for PGR amplification to 
accept these modified nucleotide triphosphates containing metal-binding functionaUty. Non- 
natural nucleotide triphosphates were purified by ion exchange HPLC and added to PGR 
reactions containing Tag DNA polymerase, three natural deoxynucleotide triphosphates, pUG19 
template DNA, and two DNA primers. Primers were chosen to generate PGR products rangmg 
from 50 to 200 base pairs in length. Control PGR reactions contained the four natural 
deoxynucleotide triphosphates and no non-natural nucleotides. PGR reactions were analyzed by 
agarose or denaturing acrylamide gel electrophoresis. Amino modified uridine derivative 7 was 
efiScientiy incoijporated by Tag DNA polymerase over 30 PGR cycles, while the triphosphate of 
23 was not an cflficient polymerase substrate (Fig. 51). Previous findings on the acceptance of 7 
by Tag DNA polymerase are in conflict, witii botii non-acceptance (K. Sakthivel et al Angew. 
Chem. Int, Ed Engl 1998, 37, 2872-2875) and acceptance (S. E. Lee et al Nucleic Acids Res, 
2001, 2P, 1565-73) reported. 

[00224] Non-Natural Metal-Binding Deoxyribonucleotide Triphosphate Syntliesis: The 
syntheses of the G5-fimctionalized uridine, G7-functionalized 7-deazaadenosine, and G8- 
fimctionalized adenosme deoxynucleotide triphosphates will be completed. Synfliesis of the 7- 
deazaadenosine derivatives from 4-chloro-7-iodo-deazaadenine (30) proceeds by glycosylatioii 
of 30 with protected deoxyribosyl chloride 38 followed by ammonolysis to afford 7-iodo- 
adenosine (39) (Fig. 31) (Gourlain et al. Nucleic Acids Res. 2001. 29, 1898-1905). Protected 
deoxyribosyl chloride 38 can be generated &om deoxyribose as shown in Fig. 52. Pd-mediated 
Sonogashira coupling (Seela et al Helv. Chem, Acta 1999, 52, 1878-1898) of 39 with N- 
propynyltrifluoroacetamide provides 40, wUch is then be converted to the 5' nucleotide 
triphosphate and deprotected with ammonia as described above to yield 13. 
[00225] To generate rapidly a collection of metal-binding uridine and adenosine analogs, a 
variety of metal-binding groups as NHS esters will be coupled to G5-modified uridme 
intermediate 7 (already synthesized) and C7-modified 7-deazaadenosine intermediate 13. Metal- 
binding groups that will be examined initially are shown in Fig. 47 and include phosphines, 
thiopyridyl groups, and hemi-salen moieties. If our initial polymerase acceptance assays (see the 
following section) of triphosphates of 8-modified adenosines 32-37 (Fig. 50) suggest that a 
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variety of 8-modij5ed adenosine analogs are accepted by thermostable polymerases, alkyl- and 
vinyl trifluoroacetamides will be coupled to 8-bromo-deoxyadenosine (31) to generate nucleotide 
triphosphates such as 41 and 42 (Fig. 53). These intermediates are then coupled with the NHS 
esters shown in Fig. 46 to generate a variety of metal-binding 8-functionalized deoxyadenosine 
triphosphates. 

[00226] Evaluating Non-Natural Nucleotides: Each fimctionalized deoxyribonucleotide 
triphosphate is then assayed for its suitability as a building block of an evolvable non-natural 
polymer library in two stages. First, simple acceptance by thermostable DNA polymerases is 
measured by PGR amplification of fragments of DNA plasmid pUC19 of varying length. PGR . 
reactions contain synthetic primers designed to bind at the ends of the fragment, a small quantity 
of pUC19 template DNA, a thermostable DNA polymerase {^aq, Pfa or Vent), three natural 
deoxyribonucleotide triphosphates, and the non-natural nucleotide triphosphate to be tested. The 
completely successfid incorporation of the non-natural nucleotide results in the production of 
DNA products of any length at a rate similar to that of the control reaction. Those nucleotides 
that allow at least incorporation of 10 or more non-natural nucleotides in a single product 
molecule with at least modest efficiency are subjected to the second stage of evaluation. 
[00227J Non-natural nucleotides accepted by thermostable DNA polymerases are evaluated 
for their possible mutagenic properties. If DNA polymerases insert a non-natural nucleotide 
opposite an incorrect (non-Watson-Crick) template base, or insert an incorrect natural nucleotide 
opposite a non-natural nucleotide in the template, the fidelity of library amplification and 
translation is compromised. To evaluate this possibility, PGR products generated in the above 
assay are subjected to DNA sequencmg using each of the PGR primers. Deviations from the 
sequence of the pUG19 template imply that one or both of the mutagenic mechanisms are taking 
place. Error rates of less than 0.7% per base per 30 PGR cycles are acceptable, as error-proiie 
PGR generates errors at approximately this rate (Caldwell et al. PCT Methods Applic. 1992, 2, 
28-33) yet has been successfully used to evolve nucleic acid libraries. 

[0022S] Pairs of promising non-natural adenosine analogs and non-natural uridine analogs are 
also tested together for their ability to support DNA polymerization in a PGR reaction contaming 
both modified nucleotide triphosphates together with dGTP and dCTP. Successful PGR product 
formation with two non-natural nucleotide triphosphates enables the iiicorporation of two non- 
natural metal-binding bases into the same polymer molecule. Functionalized nucleotides that are 
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especially interesting yet are not compatible with Taq, Pju, or Vent thennostable DNA 
polymerases can still be used in the libraries provided that they are accepted by a commercially 
available DNA polymerase such as the Klenow ftagment of £. coli DNA polymerase I, T7 DNA 
polymerase, T4 DNA polymerase, or M-MiiLV reverse transariptase. In this case, the assays 
require conducting the prinier extension step of the PGR reaction at 25-37°C, and fresh 
polymerase must be added at every cycle following the 94°C denaturation step. DNA 
sequencing to evaluate the possible mutagenic properties of the non-natural nucleotide is still 
performed as described above 

[00229] Generating Libraries of Metal-Binding Polymers: Based on the results of the above 
non-natural nucleotide assays, several libraries of ^-10^^ different nucleic acid sequences will be 
made containing one or two of the most polymerase compatible and chemically promising non- 
natural metal-binding nucleotides. Libraries are generated by PGR amplification of a synthetic 
DNA template library consisting of a random region of 20 or 40 nucleotides flanked by two 15- 
base constant priming regions (Fig. 54). The priming regions contain restriction endonuclease 
cleavage sites to allow cloning into vectors for DNA sequencing of pools or mdividual library 
members. One primer contains a chemical handle such as a primary amine group or a thiol, 
group at its 5' terminus and becomes the coding strand of the library. The other primer contains 
a biotinylated T at its 5' terminus and becomes the non-coding strand. The PGR reaction 
includes one or two non-natural metal-bindmg deoxyribonucleotide triphosphates, three or two 
natural deoxyribonucleotide triphosphates, and a DNA polymerase compatible with the non- 
natural nucleotide(s). Following PGR reaction to generate the double-stranded form of the 
library and gel purification to remove unused primers, library member duplexes are denatured 
chemically. The non-coding strands are the removed by several washings with streptavidin- 
linked magnetic beads to ensure that no biotinylated strands remain in the library. Libraries of 
up to 10^^ different members are generated by this method, far exceeding the combined diversity 
of previous combinatorial catalyst efforts. 

[00230] Each library is then incubated in aqueous solution with a metal of interest fix)m the 
following non-limiting list of water compatible metal salts (Fringueli et al Eur, J. Org. Chem, 
2001, 2007, 439-455; Zaitoun et al J. Phys, Chem, B 1997, 1857-1860): ScCU, GrGb, MnGla, 
FeGl2, FeGla, C0CI2, NiClz, GuGb, ZnCh, GaCb, YCI3, RuCb, RhCia, PdGb, AgGl, GdGh, 
InGb, SnCl2, La(0Tf)3, Ce(0Tf)3, Pr(0Tf)3, Nd(0Ti)3, Sm(0Tf)3, Eu(0Tf)3, Gd(0Tf)3, 
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Tb(0Tf)3, Dy(0Tf)3, Ho(OTf)3,. Er(0Tf)3, Tiii(0TQ3, Yb(0Tf)3, Lu(0Tf)3, IrCb, PtCb, 
AuCl, HgCh, HgCl, PbCli, or BiCb. Metals are chosen based on the specific chemical reactions 
to be catalyzed For example, libraries aimed at reactions such as aldol condensations or hetero 
Diels-Alder reactions that are known (Fringuelli et al Eur. J, Org. Cham. 2001, 2001, 439-455) 
to be catalyzed by Lewis acids are incubated with ScCla or with one of the lanthanide triflates, 
^le those aimed at coupling electron-deficient olefins with aiyl halides are incubated with 
PdCh. The metalated library is then purified away fix)m unbound metal salts by gel filtration 
using sephadex or acrylamide cartridges, which separate DNA oligonucleotides 25 bases or 
longer from unbound small molecule components. 

[00231] The ability of the polymer library (or of individual library meihbers) to bind metals of 
interest is verified by treating the metalated library firee of unbound metals with metal staining 
reagents such as dithiooxamide, dimethylglyoxime, KSCN (Francis et al. Curr. Opitt Chem. 
Biol 1998, 2, 422-8) or EDTA (Zaitoun et al J. Phys. Chem. B 1997, iOi, 1857-1860) that 
become distinctiy colored in the presence of different metals. The approximate level of metal 
binding is measured by spectrophotometric comparison with solutiops of firee metals of known 
concentration and with solutions of positive control oligonucleotides containing an EDTA group 
(which can be introduced using a conunercially available phosphoramidite from Glen Research). 
[00232] In Vitro Selections for Non-Natural Polymer Catalysts: Metalated libraries of 
eyolvable non-natural polymers containing metal-binding groups are then subjected to one-pot, 
solution-phase selections for catalytic activities of interest. Library members that catalyze 
virtually any reaction that causes bond formation between two substrate molecules or that results 
in bond breakage into two product molecules are selected using the schemes proposed in Figs. 54 
and 55. To select for bond forming catalysts (for example, hetero Diels-Alder, Heck coupling, 
aldol reaction, or olefin metathesis catalysts), library members are covalenfly linked to one 
substrate through their 5' amino or thiol termini. The other substrate of the reaction is 
synthesized as a derivative linked to biotin. When dilute solutions of library-substrate conjugate 
are reacted with the substrate-biotin conjugate, those library members tiiat catalyze bond 
formation cause the biotin groi^ to become covalentiy attached to themselves. Active bond 
forming catalysts can then be separated fi:om inactive library members by capturing the former 
with inmaobilized streptayidin and washing away inactive polymers (Fig. 55). 
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[00233] In an analogous manner, library members that catalyze bond cleav^e reactions such 
as retro-aldol reactions, amide hydrolysis, elimination reactions, or olefin dihydroxylation 
followed by periodate cleavage can also be selected. In this case, metalated library members are 
covalently linked to biotinylated substrates such that the bond breakage reaction causes the 
disconnection of the biotin moiety from the library members (Fig. 56). Upon incubation under 
reaction conditions, active catalysts, but not inactive libraiy members, induce the loss of their 
biotin groups. Streptavidin-linked beads can then be used to capture inactive polymers, while 
active catalysts are able to elute from the beads. Related bond jformation and bond cleavage 
selections have been used successfully in catalytic RNA and DNA evolution (Jaschke et al Curr. 
Opin. Chem, Biol 2000, 4, 257-62) Although these selections do not explicitiy select for 
multiple turnover catalysis, RNAs and DNAs selected in this manner have in general proven to 
be multiple turnover catalysts when separated from their substrate moieties (Jaschke et al Curr. 
Opin Chem. Biol 2000, 4, ISl'Sl) Jaeger et al Proc. Natl Acad Sci, USA 1999, P(J, 14712-7; 
Bartel et al Science, 1993, 261, 141 1-8; Sen et al Cum Opin. Chem. Biol 1998, 2, 680-7). 
[00234] Catalysts of three important and diverse bond-forming reactions will initially be 
evolved: Heck coupling, hetero Diels-Alder cycloaddition, and aldol addition. All three 
reactions are water compatible (Kobayashi et al J. Am. Chem. Soc. 1998, 120, 8287-8288; 
Fringuelli etal Eur X Org. Chem. 2001, 2001, 439-455; Li etal Organic Reactions in Aqueous 
Media: Wiley and Sons: New York, 1997) and are known to be catalyzed by metals. As Heck 
coupling substrates both electron deficient and unactivated olefins will be used together with aryl 
iodides and aiyl chlorides. Heck reactions with aryl chlorides in aqueous solution, as well as 
room temperature Heck reactions with non-activiated aryl chlorides, have not yet been reported to 
oxir knowledge. Libraries for Heck coupling catalyst evolution use PdCh as a metal source, 
Hetero Diels-Alder substrates include simple dienes and aldehydes, v/hile aldol addition 
substrates consist of aldehydes and both silyl enol ethers as well as simple ketones. 
Representative selection schenaes for Heck coupling, hetero Diels-Alder, and aldol addition 
catalysts are shown in Fig. 57. The stringency of these selections can be increased between 
rounds of selection by decreasing reaction times, lowering reaction temperatures, or using less 
activated substrates (for example, less electron poor aryl chlorides (Littke et al J. Am, Chem, . 
Soc. 2001, 725, 6989-7000) or shnple ketones instead of silyl enol ethers). 
[00235] Evolving Non-Natural Polymers: Diversification and Selecting for Stereospeciflcity 
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[00236] Following each round of selection, active library members are amplified by PGR with 
the non-natural nucleotides and subjected to additional rounds of selection to enrich the library 
for desired catalysts. These libraries are truly evolved by introducing a diversification step 
before each round of selection. Libraries are diversified by random mutagenesis using error- 
prone PGR (Caldwell et al. PCR Methods Applic 1992, 2, 28-33) or by recombination using 
modified DNA shufQing methods that recombine small, non-homologoxis nucleic acid firagments. 
Because error-prone PCR is inherentiy less efficient than normal PCR, error-prone PCR 
diversification will be conducted with only natural dATP, dTTP, dCTP, and dGTP and using 
primers that lack chemical handles or biotin groups, The resulting mutagenized products are 
then subjected to PGR translation into non-natural nucleic acid polymers using standard PGR 
reactions containing the non-natural nucleotide(s), the biotinylated primer, and the amino- or 
thiol-terminated primer. 

[00237] In addition to simjply evolving active catalysts, the in vitro selections described above 
are used to evolve non-natural polymer libraries in powerful directions difficult to achieve using 
other catalyst discovery approaches. An enabling feature of these selections is the ability to 
select either for library members that are biotinylated or for members that are not biotinylated. 
Substrate specificity among catalysts can therefore be evolved by selecting for active catalysts in 
the presence of the desired substrate and then selecting in the same pot for inactive catalysts in 
the presence of one or more undesired substrates. If the desired and undesired substrates differ 
by the configuration at one or more stereocenters, enantioselective or diastereoselective catalysts 
can emerge firom rounds of selection. Similarly, metal selectivity can be evolved by selecting for 
active catalysts in the presence of desired metals and selecting for inactive catalysts in the 
presence of undesired metals. Conversely, catalysts with broad substrate tolerance can be 
evolved by varying substrate structures between successive rounds of selection. 
[00238] Finally, the observatioiis of sequence-specific DNA-templated synthesis in DMF and 
CH2CI2 suggests that DNA-tetralkylammonium cation complexes can form base-paired 
structures in organic solvents. TTiis finding raises the possibility of evolving our non-natural 
nucleic acid catalysts in organic solvents usuig slightiy modifi^ed versions of the selections 
described above. The actual bond forming and. bond cleiayage selection reactions will be 
conducted in organic solvents, the crude reactions will be etiianol precipitated to remove the 
tetraaikylammonium cations, and' the inmiobilized ayidin separation of biotinylated and non- 
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biotinylated library members in aqueous solution will be performed. PGR amplification of 
selected members will then take place as described above. The successful evolution of reaction 
catalysts that function in organic solvents would expand considerably both the scope of reactions 
that can be catalyzed and the utility of the resulting evolved non-natural polymer catalysts. 
[00239] Characterizing Evolved Non-Natural Polymers: Libraries subjected to several 
rovmds of evolution are characterized for their abiUty to catalyze the reactions of interest both as 
pools of mixed sequences or as individual library members. Individual members are extricated 
from evolved pools by ligating PGR ampUfied sequences into DNA vectors, transforming dilute 
solutions of ligated vectors into competent bacterial cells, and picking single colonies of 
transfoimants. Assays on pools or individual sequences are conducted both in the single 
turnover format and in a true multiple tumover catalytic format. For the single turnover assays, 
the rate at which substrate-linked bond| formation catalysts effect thek own biotinylation in the 
presence of free biotinylated substrate will be measured, or the rate at which biotinylated bond 
breakage catalysts effect the loss of their biotin groups. Multiple turnover assays are conducted 
by incubating evolved catalysts with small molecule versions of substrates and analyzing the rate 
of product formation by tic, NMR, mass spectrometry, HPLC, or spectrophotometry. 
[00240] Once multiple tumover batalysts.are evolved and verified by these methods, detailed 
mechanistic studies can be conducted on the catalysts. The DNA sequences conresponding to the 
catalysts are revealed by sequencing PGR products or DNA vectors containing the templates of 
active catalysts. Metal preferences are evaluated by metalating catalysts with a wide variety of 
metal cations and measuring the resulting changes in activity. The substrate specificity and 
stereoselectivity of these catalysts are assessed by measuring the rates of tumover of a series of 
substrate analogs. Diastereoselectivities and enantioselectivities of product formation are 
revealed by comparing reaction products with those of known stereochemistry. Previous studies 
suggest that active sites buried within large chiral. environments often possess high degrees of 
stereoselectivity. For example, peptidc-based catalysts generated in combinatorial approaches 
have demonstrated poor to excelleiit stereoselectivities that correlate with the size of the peptide 
Ugand (Jarvo et aL J. Am. Chem Sac, 1999, 121, 1 1638-1 1643) while RNA-based catalysts and 
antibody-based catalysts frequently demonstrate excellent stereoselectivities (Jiscbke et aL Curr, 
Opin, Chem. Biol 2000, 4, 257-262; SeeUg et aL Angew. Chem. Int. Ed Engl 2000, 39, 4576- 
4579; Hilvert, D. Annu. Rev. Biochem. 2000, tfP, 751-93; Barbas et aL Science 1997, 275, 2085- 
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92; Zhong et al Angew, Chem. Int. Ed Engl 1999, 38, 3738-3741; Zhong et al J, Am, Chem, 
Soc, 1997, 119, 8131-8132; List et ai Org, Lett. 1999, 7, 59-61) The direct selections for 
substrate stereoselectivity described above should fiirther enhance this property among evolved 
catalysts. 

[00241] Structure-function studies on evolved catalysts are greatly facilitated by the ease of 
automated DNA synthesis. Site-specific stmctural modifications are introduced by synthesizing 
DNA sequences corresponding to "mutated" catalysts in which bases of interest are changed to 
other bases. Changing the non-natural bases in a catalyst to a natural base (U* to C or A* to G) 
and assaying the resulting mutants may identify the chemically important metal-binding sites in 
each catalyst. The minimal pol)aner required for efScient catalysis are determined by 
synthesizing and assaying progressively truncated versions of active catalysts. Finally, the three- 
dimensional structures of the most interesting evolved catalysts complexed with metals are 
solved in collaboration with local . macromolecular NMR spectroscopists or X-ray 
crystallographers. 
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Claims 

1 . A method of syntheiszing one or more chemical compounds, the method comprising the 
steps of: 

providing one or more templates, which one or more templates optionally have a reactive 
unit associated therewith; 

contactmg 6ne or more transfer units having an iEinti-codon and reactive unit with said one 
or more templates mider conditions to allow for hybridization of the one or more anti-codons to 
the template, and reaction of the reactive units. 

2. The method of claim 1, wherein a portion of the one or more chemical compounds 
contain an anti-cpdon comprising a nucleotide sequence which hybridizes with one or more 
nucleic acid templates. 

3. The method of claim 1, wherein a library of more than one compound is synthesized, the 
method further comprising: 

probing (he library to detect a library member comprising a reaction product of the 
reactive units displaying a desired property; 

detecting structural information about the reaction product; and 

iterating the method with one or more additional species to produce a new generation of 
reaction products, at least some of which display the desired property. 

3. The method of claim 1, wherein the chemical compound syntheiszed is a compound other 
than a nucleic acid or nucleic acid analog. 

4. The method of claim 1 , wherein the chemical compound is an xmnatural polymer. 

5. The method of claim 1, wherein the chemical compound is a small molecule. 
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6. The method of claim 1 , wherein the chemical compound is a small molecule and the step 
of contacting comprisies contacting with two or more transfer units in a sequential manner. 

7. The method of claim 1 , wherein the template is a nucleic acid template. 

8. The method of claim 1, wherein the template comprises DNA or RNA. 

9. . The method of clainll,\y4iereinthe template comprises DNA. 

10. A library comprising one or more chemical compound? wherem each of the chemical 
compounds is bonded to an amplifiable template whose nucleotide sequence is informative of the 
structure of the chemical compounds. 

11. The hbrary of claim 1 0, wherem the library of chemical compounds comprises a library 
of small molecules. 

12. A method for the synthesis of a library of chemical compounds, the method comprising 
the steps of: 

providing one.or more templates optionally associated with one or more reactive units; 

contacting the one or more templates simultaneously or sequentially with one or more 
transfer units comprising anti-codon units associated with one or more reactive units under 
condition suitable for hybridization of the anti-codons with the template and ieaction of the 
reactive units to produce a plurality of different library members. 

13. The method of claim 12, wherein a portion of the compounds contain an anti-codon 
comprising a nucleotide sequence wMch hybridizes with one or more nucleic acid templates. 

14. Themethodof claim 12, the method further comprising: 

probing tlie library to detect a library member comprising a reaction product of the 
reactive units displaying a desired property; 

detecting structural information about the reaction product; and 
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iterating the method with one or more additional species to produce a new generation of 
reaction products, at least.some of which display the desired property. 

15. The method of clahn 14, wherein the desired property comprises binding to a target 
protein. 

16. The method of claim 14, wherein the desired property comprises catalyzing a chemical 
reactiorL 

17. The method of claim 14, fiirther comprising the step of isolating the one or chemical 
compounds. 

18. The method of claim 14, wherein the chemical compounds synthesized are small 
molecules. 

19: A method for the synthesis of one or more unnatural polymers, the method comprising 
' steps of: 

providing one or more nucleic acid templates; . 

contacting one or more transfer units with the one or more nucleic acid template under 
conditions to allow for hybridizaUon and reaction to form bonds between adjacent monomer 
units lined up along the template. 

20. The method of claim 1 9, wherein the method ifurther comprises the step of isolating the 
unnatural polymer. 

21 . The method of claim 1 9, wherein a portion of the unnatural polymers contain an anti- 
codon comprising a nucleotide sequence which hybridizes with one or more nucleic acid 
templates. 

22. The method ofclaim 19, the method furiercomprism^^ 
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probing the libraiy to detect a library member comprising a reaction product of the 
reactive units displaying a desired property; 

detecting structural information about the reaction product; and 

iteratmg the method with one or more additional species to produce a new generation of 
reaction products, at least some of which display flie desired property. 

24. The method of claim 23, whereiri the desired property comprises catalyzing a chemical 
reaction. 

25. The method of claim 12 or 19, wherein the nucleic acid template is DNA. 

26. The method of claim 12 or 19, v^erein the nucleic acid template is single-stranded 

27. The method of claim 12 or 19, wherein the nucleic acid template is double-stranded. 

28. The method of claim 12 or 19, wherein the nucleic acid template is selected from the 
group consisting of DNA, RNA, a hybrid of DNA and RNA, a derivative of DNA, and a 
derivative of RN A. 

29. The method of claim 12 or 19, wherein the anti-codons comprise 2, 3, 4, 5, 6, 7, 8, 9, or 
10 bases. 

30. The method of claim 12 or 19, wherein the anti-codon does not allow frame-shifting. 

31. The method of claim 12 or 19, wherein the anti-codon is selected from the group 
consistmg of DNA, RNA, a hybrid of DNA and RNA, a derivative of DNA, and a derivative of 
RNA. 

32. The method of claim 12 or 19, wherein the anticpdon is associated witii the monomer xmit 
through a covalent bond. 
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33 . A method of evolving a library of compounds, the method comprising steps of: 
isolating a compound with a desired activity attached to a nucleic acid template that 

encoded its synthesis; 

mutating and amplifying the nucleic acid template; and 

synthesizing a new library of related compounds using a mutated and ampUfied nucleic 
acid template. 

34. The method of claim 33 further comprising steps of: - 
assaying compounds for desired activity; and 

repeating stdps in claim 33 based on template of polymer with desired activity. 

35. The method of claim 34 wherein the desired activity is selected fiom the groups 
consisting of catalytic activity and binding activity. 

36. The method of claim 34 wherein the step of mutating and amplifying is performed using 
the polymerase chain reaction. 

37. The method of claim 34 wherein the step of mutating and amplifying is performed using 
eiror-prone PGR, 

38. The method of claim 34 wherein the step of mutating and amplifying is performed using 
i/i vz/ro homologous recombination (DNA shuffling). 

39. A library of chemical compounds comprising one or mote chemical compoimds wherein 
each of the chemical compounds is bonded to an amplifiable template whose nucleotide 
sequence is informative of the structure of the chemical compounds, and wherein the library is 
synthesized according to the method of claim 1, 12 or 19. 

40. . The library of claim 39, wherein the library is a library of small molecules or unnatural 
polymers. 
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41 . A kit comprising one or more nucleic acid templates and one or more transfer units. 

42. The kit of claim 41 fiatiier comprising buffers, heat-stable DNA polymerase, nucleotides, 
and restriction endonucleases. 

43. The kit of claim 41 wherein flie nucleic acid template is associated with a small molecule 
scaffold. 

44. The kit of claim 41 comprising a plurality of nucleic acid templates. 

45. The kit of claim 41 wherein the transfer units comprise monomer units. 

46. The kit of claim 4 1 wherein the transfer units comprise reactants to be used in modifying 
a small molecule scaffold. 
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candidate molecules 

natural products, 
. synthetic molecules, 
proieUus, nucleic acids 




.molecules with 
desired binding or 
catalytic properties 



ott^tlficttdtfii (and 
hfintutian 



htfonmtion encoding 
molecules with desired 
properties 




itrnctitre 
delenninatioa w 
U{ dccodins, D^iA 
, ttittttuattji 



structure-activity 
relationships 
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( 



....c-z— z-c-c— z-z-c-c— z— Z-C-C— 2-Z-C 
X=A,T,orC. ! : : ! : i ! i i i i I ! : : : 
Z=T.A.orG 6-X-^i-<S 6-)^-X-<S <f— X-X-G G-X-X-G 

...-C— z— z-c— z— z-c— z— z-c-z— z-c— z-z-- 

X=A,T.orC : : : I 5 i i i i : i i i i : 
Z=^.A.-otG ^j^^ e-X-X G-X-X G-X-X G-X-X 
Ah, HO,i Ah, HO,i Ah, ho^ Ah, ho,c nh, ho/: 

4 ffModmpter a«f inp/rf non-frameshifting codon set. Eachprovides 9 possible codons. 
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•^i^^ biotin "« I aviciin — substrate 



biotin;temiinated 
bio^lymer 




substrate 2 
biotin iiiiavldin -substrate 1 



bond-cleavage ■ 

catalysis ^ ^JC^jt^^ 



'^VipT biotin »" awdin - — product 




bond-fonnatioa 
catalysis 




biotin "» avidin — product 
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te«.pIaUnEHEH--E6HMEEHH EEHHEE HH 

♦ ofraagent ^„no0303030303 03 0 303 0 3 
mismatches: ^-^^ ^ ^ 



' ' TS*; ' Smln lOinln Mmh 



template: HHHHHHEE EEEEEEE EE E 
nuolaophlle: SSSSSSSSSSSSSSNNNN 

MXMXMXMX.MXMXMXMXMX 

^, — — — , . 




o SIAB 8BAP SA SMCC QM8S 8MPS SvisB 5MCC SVSS 

9 9 _ ** SMCC BMPS «^ 



SMCC 

O, 



0 Jmtm^' 



fTgui?e (p3 
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ikmBt2SX! 



tomplbrteh: 



Figure 7^ 



45 "C 
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0 20 40 60 80 100 120 



Figure 8 
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backbone 

iofntonatfihes: 0 3 0 3 0 3 0 3 0 3 0 3 



0 3 



_ ^,j^<^gS» &, 

backbone: (DMA), '^S <«>. <«^' <^ 
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U^^^V ^ n bases ' „ „ , ^ , 

o (reageny » pempiate} 

100 




— -n-1,12JnM 



time (mln) 

-»-n10,1tSnM 
>— n-10,ZSnM 

1,125 IHM 



Figure 10 



-^cn(s,25nM 
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0 



templats-dlrectsd 
translaUonofONA 
IHxBrylrdo synthetic 



of 1,034 tonptato 

blotin 



1«02S total 
. startirm 
matsriab 



1,02SlotaI 



o 



1.025 
I presumed 

, . I products of 

" I 1,030,625 

CNONiNGNNc I theoretical 

3^ pnxHids 



1) in vitro saiaction with 
straptavidin baada 

2) PCRamp)iflcat]an 
of saleeted produds 

9 '~TGCTCc a w xcc ca ;??TTT??? c*TfceM<CTBee MtTi ; ^ ' 
DMA erraodina Mledad and amptified niolacules 

characterize tjyONA 
sequendng and digestion 



unitary product 

Figure 11 
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g condtUons product V^e'^l (%) 



1 



70 



11 10 



14 g . 13 

c 



H 53 (R » Me) 
42(RaBn) 



54 



18 o'^o 15* 




12 




16 



17 




17 19 
18 19 
20 ^ 19 . ' 
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reaction: 1 + 3 4+5 10 + 11 11 + 13 12 + 15 18 + 19 
matchedness: M X M X M X M X M X M X 




14/57 



wo 02/074929 



PCT/US02/08546 



DNA-templated 
10 bases ^ amide bond 

B product }re\dm 

J) 79.59 



4 6 



4 



4 9 °^ 



5 ° 6 



5 



o. V 



81,62 



58, 66 



.47,64 



58.53 



66.71 
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(a) 




C 

o 

TO 

E 

P o 
o 

<D 

■4-" 

2 



formation *\ 




/7 bases 



rate of 

template-reagent 
annealing 



distance 
Independent 
regime 



n 



(b) 



product — >^ 
template — ► 



/i=0 /i=0 /i=0 /i=5f0 n=fO n=fO 



time (min): 5 60 780 5 60 780 



16/57 



wo 02/074929 



PCTAJS02/08546 




17/57 



wo 02/074929 



PCT/US02/08546 




18/57 



wo 02/074929. 



PCTAJS02/08546 



autocfeaved tlnkBf scatiosa ttnker useful scsu Ifnker 
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DCC 
reageni— 



La 



X lenptate . / \ ^^^^^ 

AgC02CF3 



reagent -ST -template R-^NIf^^tanpIale 



OH **-^SH 
^ autocleaved linker 



R'^-'^^^tefnptate 
useful scar linker 
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^N^S-R 



template: + none + + none + + | 



reagent matchedness: nam X X 
ptBannaated In water?: . ^ » H 

product — ► 



M M M 
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Start Monoincr 



o 



ChNBOC 




. covalendy-inlact template DMA strand 

C "»«'"""Q^ . starting string of dCs provides 
good InitiatiQn and a site lor 
PGR pdmlng with dtgo-dG 



HO 



Photocaged prtrraiy amine 
pre^/ents premature 
Initiation of cart>amate 
poiyn^^:ation 



Me \ 



\ 



TBSQ 




8,utiiiinitB|* 



Every 2 nudeottdes 
encodes one dicarbafnate 
"mononier": this provides 
14 functional codons, 
1 start codon,1 stop 
codon 



Extend Monomers 




0 yj . 



^\sstO Me ] 





B4 imitituj 0^' 



Stop Monomer 



biopblymer a 
polymerization ^ 
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dC}nx covatently-lntact DMA double helix 
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0 



SteitHaiipin:covalcnlly| cjiinniQ— (^^ >\^d- p^^ ^ deaves 
Untedtotemplate 1 . | exdusively here • 



Atefl does not deave jQ 
phosphorofliloata ^'^x / \ o 

Qllltlllllc— C i (I ' 



0-. >^<'7-o 



encoding DNA suitable for PGR, \ /~" '^v Extension Mononien 
sequendng. etc . ^ ^ I : 



3' OHUHuniuiiinititii GGGCCX„GGC-5* 



+ PNA • °' 



digest wilhA/cfl ^* *^ 



BsiinHia,*' 

3»0H»««i«««"M""«-vvNA/SA.vvvt|otIn avldln purify ^ / ? [Stop Monomer 



* \ < /" Li 

encoding DNA sidechain-twaring Vs 



Components of an amplifiable, evolvable jimctionalized peptide nucleic acid library. 
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DNA* 



""i— u — u — u» — 

'C^--C^G— T— C— T-- A .- A A-G— A— 6--6--fc--(5— hairpin 



solvent 



coupling reagent, 
leaving groap 




PNA q 



Oregon Greetvr—^ ^^]_ 

^T«T-T-C V— G-C--G-C--G-^y 

C— G— T— C— T— A— A-«A-G— /U-C--G-*C--G--C^ i 
Tey/ reaction used to optimize reagents and conditions for DNA-templated PNA coupling. 
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Aldolase PNAs attach 
IheiDselyes covalenfLy 
to solid sttpport 



Retroaldolase PNAs 
cleave ^lemselves from 
solid si^ort 



OH OH 



Two schemes for the selection of a biotin-terminated fimctionalized PNA capable of catalyzing 

an aldol or rettoaldol reaction. 
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^ r^o^ 

T T -T C \ 



A— A'-G- 



anseal, meastne rate of acylaiioi 



varyn, 

PNA ..y^-""^ 7 




T— T-T-C 

t I 1 I . 



Measuring the rate of reaction between a fixed nudeophile and an electrophile hybridized at 
varying distances along a DNA template defines an essential reaction window in which DNA- 
templated synthesis of nonpolymeric structures can take place. 
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Butocleavlrig linker 




autocleaving linker 

S^(CHiW^HC0(CH2)sNHR 

g J 4 . 

S 



yAgCOjCFa 

iyjH(CH2feOOHN<phyr^NH 

5 fampbte 

useful scar linker 

'^H*•^Bag8^l 



Y 5 



10 



temdi 8 Hi 8 H Vl; 5 OH " 

11 



^ pH 



i 



NHa 



NalO4.pH5.0 



9 O 



12 




I 1 



I 



unstained (dansylated 
species only . are vidble) 



Stained with ethldium bromide 




Hqm '<33 
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X mdecul cy productand 




unreadedtemplala 
Ventertfie next step 



' producti* 



UoSn * 



synlhesis usng an ^ 



autodeavlnglkiier 




+ blofln vnash 

tinted beads 




molecule ^ elutepiodudby 

tintercbav^ge 



product' 

mdecul) 



DN^tempbtedV produc t ^ 

synthesis using V V 

alracBlessihter \. 




a useful scar linker + ^^^^y^^^^ template 

'^'^^ Mdin-- 
* IhkBd bead^ 
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3 • -CCSX3CXXCCCCA6G'r»'^HBa-5 • 



1) EDC.Hilfo-NHS 

2) NH40H 



blotip 

3 « -CCGXXXXCCCGACOW^MB 



« i 



template pool 



3 • -OCaXXlOCCCCOAGCTT'^Re 

5 • -GC0X»atCCaXXXXCCCX»atC0CXXXX60GCTCC»A-3 • 



1 



1) Ken/ DNA pdymerase 

2) denature strands, putiify with 
avidin tnognettc beads 



3«-C0CXJ«CXGCCWCXXCG{n^^ 



35/57 



wo 02/074929 



PCT/US02/08546 



O n nagmtOtnty 

BDC.sulA>4«IS ^ ^ 



1) AC|0 (mppins) 

2) pH MS 



R< 0 



f) Hi O 

EDCsolfo-NBS 
2)Ac20,llienpHllJ 



l)pH 



2) IDC,HJlf(»-NHS 

3) Ac,0 



«« s. 
BDCsalfo-ffHS 
4)Ac,0^llien pHlLS 



If 



OyNH 



O 



Sirvefiira* ofrtagmnt Bbntks: 
S'.GCCXXXXCGC-Oiika^w add, 
ff^CCXXXXGCC-nnker-ofliliw add, 
S'-<;cC»XXXCCG-llnkePi«iBbe add^ 
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y.i»ag9fitoyg<K>^[j 0 ^ O {j | 



NHS-0' 
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basesZI-SO 

EDC.sulfo^S I " 

ONA-templatad I 

amide fbnnation | 

(stepi) I 



lenplalo'wvw^l 



1 



capture wfth avUin-Dnked 
beads, ehJtewRhpH 11.6 



14 8 

I anneal second nagent 



1}DMT-MM(step2) ' 
2} avidin beads, then 



pH 11.8 buffer I " 



ttnpiale^/VWW^I 

15 0 " 



anneal ihkd reagent 



a - " ' ' 

1 



OH 



^1 



1) OMT-MM (step 3) >P*^OH 

2) avldlnbeads.t})en 
pH 11.8 buffer 

" -OH 



t«n|*il«rvwwv*-| 

16 ^ 
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tasas 21-30 



«0-.|^ 17 



ONMempiated 
sfnUft fionnsUon 
(step 1,77%) 



0 

^ OH 
Ph 




1) DNA4emplaledWIUlg ^i^Ns^^uu^ 
dennaUoii(stdp2.66%) I PirvT N»Mon«j^ 

2) wfa8hwahavUinbeeds I 



I 




b 21 

anneal third reagent 




UoUn I PIT ' H 

I ONMemplaled conjugate 
addtUiin(itepa.7S%) 

Molta 0 Q,.0 o 




I capture with 
straptevldln bc8d8| 
ehdawOhpHliabulfer 



24 Rt°<CHd^Hi 
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BH— synthflUe Bbrary 



I CM « 



akOoccx wasTC Ka asxTC OCOBMCTT-HM-synmetic molecule 



IFCRwttfa S'-CGAGCiVGCACCAGCG^* and 
3>.GOGXXXXCCCGAGGT(bi'otln)T.NH-amiiio acfd 

3«-act U CT o aMBtc u cutBM g e«^OBg a BMB^^ material 
s*-ocMscK»AccMcencroomTco8CCToac9ccii^^ (inaivcopto; 



DNA-*cmpiated <ygUi«tb 



. DNA 
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t • - eaMCAioaiccMcocACTC cocct ooatc cocct ciiGGTOCxcacqxcT c CTxc oacet ccxfc- a • 



ttumleiawcotftng 3«-ocTC8TCOt«a»cocTCAOoocwACCCCTACooanct:cAcccocacTnAOOTtacccoAaoTT-a»--R 
5» -co*oc*oc*cc*ocoxoTccceccToooaiiTecccc«oi5TaQacocaJ»CTCcAxcoaocTccAA,- J • 

dtgat w«h MCI Md ifian 



OACCTASOCOIW TaAC0*TOCCC«OOt»-ill-R 
//•fi?f2?i?5JS?S5?SK?5?^^,, QATCCOCCCCCa CC»fcC0a0CTCC**.3' 
S'-CaMCACMCCAOCOABTCeCOCCtO wcCACCCflCOC 

•TTCoacacaACT TaAoaTT9cccaAoan'Rir~R 
<i%cccctACoaa ccaacoosctccaa-s* 

QOOAZOCCCCtM 

I T4DMAngn« 

J • -acTcaTCOTaaTcaeoTOAoacCQACcecTAeaafiuefCAeeeaeaRtaAaeAtoceeaAaaTT- bm—R 
s*«eaAocABCAccJUieacACTceaccTaaooAT6eecccMi^iaaGcacaACTeczAC«aoeTCCAA-3t 

daa^itartuaptatsa ^ ^ .qc»coTCOToa«caeTe*ooacoaACc»Aeficdiia«ctfc»coTqcaMaAOOTT«ccco*«otT-wi— B 
s * '>eoMCASCAccAoea AOTGecqs{;.TcaATccasQF c<jeot ocAoocnAct ccaac oooct ccaa- a • 
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HB^V^ ^ °S ■< 




R — y-thxMCTPboBiieelodde J-triphoip hn t B ' 
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'^^H 



Rr r-deoxyribose-y-triphosphate 



^Y"' ^COOH 
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k 

22 



pdCl^NaO Ac buffer. j^, 
N-^llyltrifluoFoacetamide 

58% 

R=2'-deo3cyribose 




R 
23 



1) POCb, proton sponge, 
mooQFj trimethyl phosphate ^ ni 

2) tri-n-butylanunoniuin 
pyrophosphate, DMF 




IH2 

R*=2*-deoxyribose 
S*-tripho^hate 



4S/S1 
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25 76 ^^'TTF^ NaOEtfEIOH "=^yS^" 

24 O 59% 

-ZZJL^ /^V*^NH ^ POCb >rV^ N-todosuccinlmide 

87% R 

30 



1) Raney Nl 



91 % 
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NH2 NHa NH2 NHi 

0 '-p^ 1^ .^i> ^ 

R . . R R R 

Ra^-deoKyrttioso 31 32-37 



Ri- 



^-deoxyrilNng 



H 



I 

R 

R'a2r-deoxyribose- 
94rtphosphalB 



a (R 3 Me): 1) HMDS, dloxane. 2) Me^Sn. Pd(PPha)<, NMPi 3) K2CO3. MeOH 

b (R » Et): 1} HMDS, dloxane, 2) EUSn. PdrPPhak UhSP, 3) K3CO3. MeOH 

c (R » CH2»CH2): 1) HMDS, (iioxane. 2} (CH2»C?^ Pd(PPIi3}4. NMP. 3} l^COs. MeOH 

d(R»NHM0):MeNHtH2O 

s(RaNHEl}:EtNH2, HjO 

f (R = hlstammyt): Mstamino, EtOH. heat 
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finite 52. 
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41 R* 42 R- 

R* = 2'-d eoxidbose-S'TlriDhosohate 
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20 or 40 random bases 
8 * -ACGtMCCgCgTCCCII M IIWI MmiHM II in n u reilingrCCGTCAtCOAOCCCT'' 3 ' 

synOwtietiBmplatBUbnry 



3 '-aGCAGTAGCZCGGGAS^^Bs-S • 
S * -^ACGCASCGOeCtCGC- 3 * 



DNA polymerase 
dCTP.dGTP 
* » npiMuitaril dAn'P or d ATP 
nadeotlde dUTP or dTTP 



a • -ATGCATCGCCQCAgCGlltm fiM M ilMIIMMSH& aHll i l MI I O OCAgTACCTCSOgAT*MIH,»S 
S • -TACGTAGCGCCCTCGC M M M M M !nilMHinnillWnnnillCC8TCA*CSAfiCCgtA-3 * 
I * * * «• 

blatin 

denature strands, remove 
ttndcsired strand with 
avklin magnetic beads 



3 • -AI«CATC6CCaCJl6C6m»8 



A6CTcooaAX<^nra,-5 * 
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o 



Band Fonmtton Sdecton I | 

Rbraiy 



^t^&yi^^. ] »iib«tratfc-W<mn 
divecd^ loto next | I ^ 



capture acdve ^ 



catalysis with ^Jz^'^ 
j^jfo immobiUzed HM-^ 

^^J^produc-Wblin . ^ 



O jkpt aceneaialyst 
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